Gene Ndas_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1904 
Symbol 
ID9245754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2322483 
End bp2324519 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content67% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003679838 
Protein GI297560864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.674112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAA AACCACATAA AAACAAAGAG CAAACGAGCC GTCGCAGGAA GGAACACAAA 
GACGAGAAAG GCAAGGACCG GGCCTCGGCG CCGCGGAGCC GGACGGGCCA CACCGCGCTC
GGGAGCCTGG ACTACCAGCA CCCCACGTAT CCCCACGACA CACACCCGGT CCTCGTCCCG
GGCATCTCGA TCGACGACCA GCGCCGCAGC TACCGGGTCG ACTGGCTCGT CTTCGCGGTC
GCCGGATCGC TGACGGTCGC CTTCGTCGTC TGGGGCATCT GGTCGCCGGG AAGCGTCGCC
GCGGTCGCCG AGACGGCGTT CTACTGGTCG ACCGACAACC TCGGCTGGAT GTTCAACGTC
GTGGCCATCG TCGTGCTCGT CTCCACCGTG GGCATAGCGT TCTCGCCCTA CGGGAGGATC
CCGCTGGGCA AGGACGGCGA GCGGCCCGAG TTCAGCACGT TCTCCTGGAC GGCCATGCTG
TTCGCCGCCG GGCTGGGTGT GGCGGTCCTG TTCTGGGGGC CCTCGGAGCC GCTCGGCTAC
TTCATCTCAC CGCCTCCGCT GACGAACGAG CCGGAGTCGG TCGAGGCCAT GCACACGGCG
CTCGCCCAGA TGTACTACCA CTGGGGCTTC CACGCGTGGG CCGTCTACGC GCTGGTCGGC
GGGGCCGTCG CCTACGCCGC CTACCGCCGC GGCCGTCCCC TGCTCATGTC CTCGATCTTC
CGCGCCCTGT TCGGCCGACG GCTCACCGAG GGTTTCGCCG GAAAGCTCGT CGACATCTTC
GCGATCATCG CCACGCTCTT CGGCACCGCC GCCGCCCTCG GTATCGCGGC GATGCAGATC
GGCTCCGGCG TCAGCATCGT GTCGGGCGCC GGGGACCTCA CGAACAACAC CCTGGTCGTC
ATCATCGCGG TCCTGACCGT CGGCTTCGTC GTCTCGGCGG TCTCGGGCGT CGCACGGGGC
ATCCGGCTCC TGTCGAACGT GAACATCGTG CTCACGATCG GCATCGTCGC GGTCTTCCTC
TTCCTCGGCC CCACACTGTT CCTCCTGAAC CTGCTGCCCT CGGCGGTCAT GGAGTACTTC
GGCTCCCTGT TCGACATGAT GGGCCGATCG CTCTCGTGGG GTCCCGAGAC GCAGGAGTTC
CAGTCGCTGT GGACCGTCTA CTACTGGGCG TGGTGGATCT CCTGGTCGCC CTTCGTGGGC
ATCTTCCTCG CGCGCATCTC CCGCGGCCGC ACCATCCGCC AGTTCACCCT CGGCACGATC
ATCATCCCGT CCTCGCTGCT CTTCGTCGCC TACGGGGTGA TGGGCGGAAC CTCCATCTGG
ATGTACCGGG AGGGCGCCCC CGGTCTCACC GAGGGCATGC CCGCGCCCGA GGTGCTCTTC
GCCCTCATCG ACAACCTGCC GTACGTGGAG TGGCTGCCCT TCGTCGTGAT CGTGGTGCTC
GCGATCTTCT TCATCACCGC CGCCGACTCC GCGTCGGTGG TGATGGGCAT GCTCACCACG
CAGGGGGACC AGAACCCGCG CCTGTGGGTG GTCGTCTTCT GGGGCCTGGT CATGTCGGGG
ATCGCGATCG TGATGCTGCT TCTGGGCGAC GCGACGGCGT TGACGGGCCT GCAGCAGCTG
GTGATCGTCA CGGCGGTGCC CTTCGCGCTC GTACTGGTCC TCGCCGTCGT CGCGTGGTTC
AGGGAGCTGC GCACCGATCC CCTCACCCTG CGCATGCACT ACATCGACAC GGCGATGGAC
AACGCCGTGA CGGAGGGGGT GGACCGCTAC GGCGACGACT TCTCCCTGAA GGTCGTGGAA
TCGCGGCCCG GGGACGGAGC CGGGGCGGGC ATCGACTCCA CCGACGAGAG CTACACGGAG
TGGTACCAGC GCACCAATGA GGAGGGCGAA CCGGTCGGCT TCGACTTCGG GACCGGGGAG
TGGGCCGACG GCTACGATCC GGACACCGGT GAGACCTCGG AGACCGCCCC CGAAGGGGCC
GTGGTGGCAC GGGAAGACCG CGTCGAGGTG TCGGAGACCG GGACCGAGGA ACGCTGA
 
Protein sequence
MPEKPHKNKE QTSRRRKEHK DEKGKDRASA PRSRTGHTAL GSLDYQHPTY PHDTHPVLVP 
GISIDDQRRS YRVDWLVFAV AGSLTVAFVV WGIWSPGSVA AVAETAFYWS TDNLGWMFNV
VAIVVLVSTV GIAFSPYGRI PLGKDGERPE FSTFSWTAML FAAGLGVAVL FWGPSEPLGY
FISPPPLTNE PESVEAMHTA LAQMYYHWGF HAWAVYALVG GAVAYAAYRR GRPLLMSSIF
RALFGRRLTE GFAGKLVDIF AIIATLFGTA AALGIAAMQI GSGVSIVSGA GDLTNNTLVV
IIAVLTVGFV VSAVSGVARG IRLLSNVNIV LTIGIVAVFL FLGPTLFLLN LLPSAVMEYF
GSLFDMMGRS LSWGPETQEF QSLWTVYYWA WWISWSPFVG IFLARISRGR TIRQFTLGTI
IIPSSLLFVA YGVMGGTSIW MYREGAPGLT EGMPAPEVLF ALIDNLPYVE WLPFVVIVVL
AIFFITAADS ASVVMGMLTT QGDQNPRLWV VVFWGLVMSG IAIVMLLLGD ATALTGLQQL
VIVTAVPFAL VLVLAVVAWF RELRTDPLTL RMHYIDTAMD NAVTEGVDRY GDDFSLKVVE
SRPGDGAGAG IDSTDESYTE WYQRTNEEGE PVGFDFGTGE WADGYDPDTG ETSETAPEGA
VVAREDRVEV SETGTEER