Gene Ndas_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1864 
Symbol 
ID9245714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2275920 
End bp2276978 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content75% 
IMG OID 
Productaminotransferase class V 
Protein accessionYP_003679798 
Protein GI297560824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.126544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.578311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG CCGGACGGGC CCGCCTGCGG GCCGCGCAGC AGGAGTTCGC TGCCGAGAAC 
ACCTACCTCA ACACCGCCAC GCACGGCCTG ACGCCCCGGC GGGCGCTACG GGCCCTGGAG
CAGCACACCC GCGACGTGGC CGCGGGCCGG TTCCAGCCGG GTGGGGCAGA CGCGGGCATC
GAGCGCGCCC GCTCCGCCTA CGCGCGCCTG ACCGGAGTGC CCGCCTCCCA CGTGGCCATC
GGCAGCCACG CCTCCCAGTT CGTGGGCACG GTCGCCGCCA GCCTGGCCCC GGGGGCGCGG
GTGCTGACGG CGGCCGACGA GTTCAGCTCG GTCGTCCACC CCTTCATGGC CAGGGCGGAC
GCGGGGATCA GCGTGCGCGA GGTGCCCCTG GCCGACCTGG CCTCGCGGGT GGGGCCGGAC
ACCGACCTGG TGGCCGTCTC GGCCGTGCAG TCCGCCGACG GCGCGATCGC CCCGGTCGAG
GACCTGATCG CGGCCTGCGC CGACCACGGG GCGCGGCTTC TGCTGGACAC CACCCAGGCC
GCCGGGTGGC TGCCGCTGCC CGTGGACCGG GTCGACTACA CGGTGTGCGC GACGTACAAG
TGGCTGCTGG GCCCGCGCGG TTCGGCCCTG CTCACCGGCA CCGAGGAGGC GCTGGCCAAG
CTCTCACCGC TGGCCGCGAA CTGGTACGGG GCGCACGAGC CCTGGAAGTC GCTCTACGGC
GGCCCGCTGC GGCTGGCCGA AGGCGCGCGG CGCCTGGACC TGGCGCCGGT GTGGGGGGCG
TGGGTCGGCC TGGAGGAGAC CCTGTCCCTG GTGGAGGAGG TGGGCGTGGA GGTCATCCGC
GAGCACAATG CCGCCCTGGG AGACCGCTTC CGCGAGGCCG TGGACATGGA GCCCGCCGGG
TCGGCGATCG TGTCGGTCCC GGTGCCCGAG GGCGCGGTGG AGCGCGTGGC CGAGGCGGGG
ATCGTCACGG CGGCCCGCAA CGGGCGGATG CGCGCGGCGT TCCACCTGTA CAACGACGAG
TCCGACGTGG ACCGGCTTGT CAAGGCGCTC AAGGGCTGA
 
Protein sequence
MDEAGRARLR AAQQEFAAEN TYLNTATHGL TPRRALRALE QHTRDVAAGR FQPGGADAGI 
ERARSAYARL TGVPASHVAI GSHASQFVGT VAASLAPGAR VLTAADEFSS VVHPFMARAD
AGISVREVPL ADLASRVGPD TDLVAVSAVQ SADGAIAPVE DLIAACADHG ARLLLDTTQA
AGWLPLPVDR VDYTVCATYK WLLGPRGSAL LTGTEEALAK LSPLAANWYG AHEPWKSLYG
GPLRLAEGAR RLDLAPVWGA WVGLEETLSL VEEVGVEVIR EHNAALGDRF REAVDMEPAG
SAIVSVPVPE GAVERVAEAG IVTAARNGRM RAAFHLYNDE SDVDRLVKAL KG