Gene Ndas_3665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3665 
Symbol 
ID9247534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4399972 
End bp4401336 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content74% 
IMG OID 
Productcitrate/H+ symporter, CitMHS family 
Protein accessionYP_003681569 
Protein GI297562595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.393946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACCG CCATGGGTTT CGCCACGATC GCCGTGGTGC TCCTGCTGCT GTTGTCCAAC 
CGGGTGGGCG CGGTGGTCGC GCTCGTCGGC GTCCCGGTGG CGGCCGCCTT CGCGCTGGGC
TTCGGCCCCG GCGAGGTGGC CTCCTTCGTC GGCGAGGGCA TCGGCGGCGT GGCCTCCACG
ACCGCGATGT TCGTGTTCGC CATCCTCTAC TTCGGGGTGA TGCGCGACGC GGGGCTGTTC
GCGCCGATCA TCCGCCGCGT GCTCGGGTTC GCCGGGAACA CGCCGGTGAC CGTGGCGGTC
GCGACGGTGG TGCTCGCGAT GGTCGCCCAC CTGGACGGCG CGGGCGCCAC CACCTTCCTC
ATCACCATCC CGGCGATGCT CCCGCTCTAC GACGCCCTGG GCATGCGCAG GGTGGTCCTG
GCCGCGCTGG TCGGGCTGGG CGCGGGGATC ATGAACATGC TGCCGTGGGG CGGGCCCACC
GCCCGGGCCG CCACCGTCCT CGGCGTTCCG GCCAACGAAC TGTGGGCCCC GCTGATCCCG
GCCCAGCTGG CGGGCATGGC CGCCTGCGTC GCCGTCGCCT GGTACCTCGG CCACCGCGAG
CGCGTGCGGC TGGCGGCGGC CGACCCCCTG CCCTCCCCGG TGCCCGTCGG GGGCGGCCGC
CCGGGCGGGA CCGCCGGAGA GGGCGGGGCC GACCCGCTCG CCGCGGGCGG GCAGGAGCCC
GACGCCGACC TGCTGCGCCC GCGCCTGTAC TGGGTGAACG CCGCGCTGAC CGTCGCCGCC
GTCCTCGCCC TGGTCTTCGG GCTGCTCTCC CCCGAACTCG TCTTCATGCT GGCCCTGGTC
GTGGCGCTCG TCGTCAACTA CCCGGGGATG AAGGCCCAGA CCGACCGCGT CAACGCCCAC
GCCTCCGGGG CGATCCTCAT GGCCAGCACC CTGCTCGCCG CGGGCGTGTT CCTGGGCGTC
ATGGACAGCA GCGGCATGAT CGAGGCGATG GGACAGGCCA TGACCGGCGC CATGCCCGGT
TTCCTCGGCC CCGGGATGGC CGCGATCGTC GGTGTCCTGG GCGTGCCCAT GAGCCTGCTC
TTCGGCCCCG ACGCCTACTA CTTCGCCGTC ATGCCGGTGC TGACCGCCGT GGGCGAGGGC
TTCGGCGTCG CCGCCGCCGA CATCGCCCAG GCCTCGATCA TCGGCCAGGA GACCGTCGGC
TTCCCGATCA GCCCGCTGAC CGGCTCCTTC TACCTGCTGG TCGGCCTGGC CGGGGTGCCC
ATCGGCTCCC ACATCCGCTT CCTGCTGCCG TGGGCCTGGC TGGTCAGCCT GGTCGTCCTC
GCGGTCGCCC TCGCCAGCGG GGTCGTCCCG CTCTGGGTCG GCTGA
 
Protein sequence
MLTAMGFATI AVVLLLLLSN RVGAVVALVG VPVAAAFALG FGPGEVASFV GEGIGGVAST 
TAMFVFAILY FGVMRDAGLF APIIRRVLGF AGNTPVTVAV ATVVLAMVAH LDGAGATTFL
ITIPAMLPLY DALGMRRVVL AALVGLGAGI MNMLPWGGPT ARAATVLGVP ANELWAPLIP
AQLAGMAACV AVAWYLGHRE RVRLAAADPL PSPVPVGGGR PGGTAGEGGA DPLAAGGQEP
DADLLRPRLY WVNAALTVAA VLALVFGLLS PELVFMLALV VALVVNYPGM KAQTDRVNAH
ASGAILMAST LLAAGVFLGV MDSSGMIEAM GQAMTGAMPG FLGPGMAAIV GVLGVPMSLL
FGPDAYYFAV MPVLTAVGEG FGVAAADIAQ ASIIGQETVG FPISPLTGSF YLLVGLAGVP
IGSHIRFLLP WAWLVSLVVL AVALASGVVP LWVG