Gene Ndas_2901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2901 
Symbol 
ID9246752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3468547 
End bp3470910 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCarbonate dehydratase 
Protein accessionYP_003680818 
Protein GI297561844 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0652695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAACG ACGCCCAGGG AGTCACGCCC GAGAGGCCGC CCACGGCCTC ACCCGGGCAA 
CGTTTCGACC GCCACAGGGG GATCGCCGCC GACGTCGGCG CCTCACTGGT GGTCTTCCTG
GTCGCCGTCC CCCTGTCCCT GGGGATCGCC GTCGCCTCCG GCGCCCCCCT CATCGCCGGG
ATCATCGCCG CCGTGGTCGG CGGGATCGTC GCCGGACTCG TCGGCGGGTC CGTGGTCCAG
GTCAGCGGCC CCGCCGCCGG CCTGACCATC ATCGTCGCCG ACCTGGTCAT GACCTACGGA
TGGCGGGTGA CCTGCCTGAT CACCCTGCTG GCCGGGCTGG TCCAGCTCGC CCTCGGCGCC
TTCCGCATCG CCCGGGCCGC CCTGGCCGTC TCCCCCGCCG TGGTGCACGG CATGCTCGCC
GGGGTGGGCG TGACCATCGC CCTGGCCCAG CTGCACGTGG TCCTGGGCGG GGAACCGCAG
AGCTCGGCGG TGGCCAACAT CGCCGACCTG CCCCACCAGA TCGCCAACAA CCACACCCCG
GCCGTCGCGG TCGGCGTCAT CACCATCGCG ATCATGTTCA CCTGGAACAA GCTGCCCTCC
CTCGGGCGGC TGCGGCCCGC CGTGGTGCCC GCCGCGCTGG TCGCCGTGGC CACCGCGACC
CTCATCTCCA CCACGAGCGG CTGGCAGGTG CAGACCGTCG TCCTGCCCGG CTCCTTCGCC
GACGCCTGGA ACGGCCCGAT GCTGCCCGAG GCCGGGCAGT GGGACGGCAT CGCCCTGAGC
GTGGCCGCCG TGGCCATGGT CGCCAGCGTC GAGTCCCTGC TCGCCGCGAT CGCCGTGGAC
CGCATGCACA GCGGCCGCCG GGTGATGCTC AACCGCGAGC TGTGCGGCCA GGGCGCGGCC
AACACCATCA GCGGCGCCCT GGGCGGGCTG CCGGTGGCCG GTGTGATCGT GCGCAGCACC
ACCAACGTGC GCGCCGGGGC GCGCAGCCCG CTCTCGACCA TCCTGCACGG CGTGTGGATC
CTGCTGTTCG TCGCCCTGTT CGCGCACGTG GTCGAGCTGA TCCCGATGCC CGCGCTCGCG
GCGCTGCTGG TGTTCATCGG CGTGCAGATG GTCTCGATCG CCCACCTGCG CGACCTGCGC
CGCCACCACG AGGCCAGCGT CTACCTGGTG ACCCTGTTCG GCGTGGTGTT CCTGGGGCTC
CTGGAGGGCG TGTTCATCGG CTTCGCGCTG GCCATGATCG TCTCCCTGCG CAGGCTCACC
AAGCTGACCG TGACCACCGA GGAACGCGAC GACCGGGTGC ACATCACCGT GCACGGCTCG
CTCACCTTCC TGGGCGTGCC CCGGCTCGCG CACGTGCTGC GCACCGTCCC CTCGGGCTCA
CGGGTCGACC TGGACCTGCA CGTGGACTTC ATGGACCACG CCGCCTTCGA GGCCATCCAC
GCCTGGCGGG TGGACCACGA GCGCACCGGC GGCAGCGTCG ACATCGACGA GGTGCACGAG
AAGTGGTACA CGCGCAGTTC CACCCGGTCG GCGCCCGCCG CCAAGACCGC GCCCGGCGGC
CTGGCCCGCT GGTGGGCCCC CTGGGAGATG CGCGGTGACG GCGACCGCGG GGTGAACGCG
CTGGGCCTGC TGACGGCCGG CGCCCGCGAG TACCACGCCA GCACCACCGA CCGGATGCGG
TCGGTGATGA GCCGCCTGTC GCACGGCCAG AACCCGACCG CGCTGTTCGT CACCTGCGCC
GACTCGCGCG TGGTGCCCAA CCTCATCACC GCGAGCGGGC CCGGCGACCT GTTCACCGTG
CGCAACCTCG GCAACCTGGT GCCGCCGCGG GAGGCCCCCG ACAACGGTTC GACGGGCGCG
GCGATCGAGT ACGCGGTGAA CGTGCTGCGG GTGCCCTCGA TCGTGGTGTG CGGACACTCG
CACTGCGGGG CGATGCAGGC CCTGCTGGAG AAGGCCCACC TGGAGACGGA CGAACAGGCG
TCGCACATGC GCCGCTGGCT GTCACACGGC TCGGAGAGCC TGGCGCGGGT GGGCGAGGAG
TCGGGCGCCC TGTCGGGCCT GCCCACGGCT GAGGCGCTGC GCCGCCTGGC CCAGGCCAAC
GTGGAGGCGC AGATCGGCAA CCTCGCGAGC TACCCGGTGG TCCGCGAACG GGTGGAGGCG
GGCGAGCTGA CGCTGACGGG GATGTACTAC GACCTGGAGA CGGCGAGGGT GCACCTCCTG
GACGCCGAGA GGGGGGAGTT CGTCCCCGTG CAGGGCGTCC AGGACGTGAA CGACCCCGTG
CCCCACCCGA GGACGGATGC GGACCACGGG GATCAGCTGG TGGAGGAGTC CTCGTCGGGC
GCGTCGTCGC GTCCGTCCTG CTGA
 
Protein sequence
MRNDAQGVTP ERPPTASPGQ RFDRHRGIAA DVGASLVVFL VAVPLSLGIA VASGAPLIAG 
IIAAVVGGIV AGLVGGSVVQ VSGPAAGLTI IVADLVMTYG WRVTCLITLL AGLVQLALGA
FRIARAALAV SPAVVHGMLA GVGVTIALAQ LHVVLGGEPQ SSAVANIADL PHQIANNHTP
AVAVGVITIA IMFTWNKLPS LGRLRPAVVP AALVAVATAT LISTTSGWQV QTVVLPGSFA
DAWNGPMLPE AGQWDGIALS VAAVAMVASV ESLLAAIAVD RMHSGRRVML NRELCGQGAA
NTISGALGGL PVAGVIVRST TNVRAGARSP LSTILHGVWI LLFVALFAHV VELIPMPALA
ALLVFIGVQM VSIAHLRDLR RHHEASVYLV TLFGVVFLGL LEGVFIGFAL AMIVSLRRLT
KLTVTTEERD DRVHITVHGS LTFLGVPRLA HVLRTVPSGS RVDLDLHVDF MDHAAFEAIH
AWRVDHERTG GSVDIDEVHE KWYTRSSTRS APAAKTAPGG LARWWAPWEM RGDGDRGVNA
LGLLTAGARE YHASTTDRMR SVMSRLSHGQ NPTALFVTCA DSRVVPNLIT ASGPGDLFTV
RNLGNLVPPR EAPDNGSTGA AIEYAVNVLR VPSIVVCGHS HCGAMQALLE KAHLETDEQA
SHMRRWLSHG SESLARVGEE SGALSGLPTA EALRRLAQAN VEAQIGNLAS YPVVRERVEA
GELTLTGMYY DLETARVHLL DAERGEFVPV QGVQDVNDPV PHPRTDADHG DQLVEESSSG
ASSRPSC