Gene Daro_4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4155 
Symbol 
ID3566616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4452921 
End bp4454468 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content60% 
IMG OID637682627 
ProductNa+/solute symporter 
Protein accessionYP_287351 
Protein GI71909764 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.128874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCT GGTTCGTCGT TCTCTACCTG CTCATTTCCA TCGGTATCGG GCTGTTCGCC 
GCGACCCGCG TCCATAGCGC CAAGGATTTC GCAGTCGCTG GCCGCCATCT GCCGCTGCCG
GTCGTTACGG CGACCGTGTT TGCCACCTGG TTTGGTGCTG AAGCGGTGTT CGGCGTGTCC
GCCACCTTCG TCAAGGATGG CCTGCGCGGC GTCGTCGCCG ATCCGTTCGG TTCCTCGATG
TGCCTGATCA TTGCTGGCGT CTTTTTCTCG CGCAAGCTCT ACAAGCTCAA CATCCTGACG
CTCGGCGACT ATTTCCGGAT GCGCTACAAC CGGACGGTCG AAGTGCTGAC CACGCTGTGC
ATCGTGGCCA GCTATCTCGG CTGGGTATCG GCCCAGATCA AGGCCCTCGG GCTCGTCTTC
AATGTCGTCA CCAATGACGG CATCAGCCAG ACAGCCGGCA TGATCCTCGG CGCCGCCATC
GTCCTCACCT ACACCACCTT CGGCGGCATG CTCTCGGTCG CCATCCTCGA TTTCGTTCAG
ATGGGCGTCA TCATGGGTGG CATGCTGTTC ATTGCCTGGA TCATTTCCGG CCCGGCCGGC
GGCATCGAAA CGGTGATCCA GCATGCCTCG AGCGCCGGCA AGCTCGACTT TTTCCCGCCG
CCCGACCCAT GGCAGTGGCT GACCTTCCTT GGCGCCTGGA TCACCATGAT GCTCGGTTCA
ATTCCGCAGC AGGACGTCTT CCAGCGGATC ACCTCGGCCA AGAGCCAAAA AATCGCGCTG
TGGGGATCCT TCCTCGGCGC CTCGATCTAC TTCTGCTTCA CTTTCGTGCC GATGTTCATC
GCCTACTCGG CCACGCTGAT CGACCCCGAT CTGTTCAAGG GCCTGCTCGA GACCGACTCG
CAACTGGTTT TGCCAACACT GGTCCTGCAA CACACACCCG TCTTCGCCCA GGCCATTTTC
TTTGGCGCCG TGCTGTCGGC GATCATGAGT TGTTCCTCGG CTACCTTGCT CGCTCCGTCG
GTCGCCTTCT CGGAAAACAT CGTCCGCGGC TTGTTCCCGC ACATGGGTGA TCACGAATTC
CTGCGCGTCA TGCGCGCTTC CATCGTGGTT TTTGCCGGCA TCGTGCTCGG CTTCGCACTG
TATTCCAATG CCAGCATCTT CAAGATGGTG GAAAACGCCT ACAAGATCAC GCTGGCCGGC
GCATTCGTGC CACTTTTCTT TGGCGCTTTC TGGAAACGGG CGACCACGCA AGGGGCGTTG
GCGGCAATCC TCGGTGGCCT GTCATCGTGG ATCCTGGTCG AAGTGCTGGT TAGCGTAAGC
GGCGAGACTG CTGGTCGCGG CGATTATGCT TATGCGCTGG CCAATGCCGG GCAACTGGTA
CCGCCACAAC TGATCGGACT TGGCGTCAGC ATTCTCGGTA TGGTTGCCGG CTCACTGTTG
CCACAATGGG TCGGCCATCC ACTGCCGCAA CAGGACATTC ACGAAGCACT GCGTCATCGC
GCCGCAGCTG AAACTCACCA CGCAACGGAA CACCAGCATC ATCATTAA
 
Protein sequence
MLIWFVVLYL LISIGIGLFA ATRVHSAKDF AVAGRHLPLP VVTATVFATW FGAEAVFGVS 
ATFVKDGLRG VVADPFGSSM CLIIAGVFFS RKLYKLNILT LGDYFRMRYN RTVEVLTTLC
IVASYLGWVS AQIKALGLVF NVVTNDGISQ TAGMILGAAI VLTYTTFGGM LSVAILDFVQ
MGVIMGGMLF IAWIISGPAG GIETVIQHAS SAGKLDFFPP PDPWQWLTFL GAWITMMLGS
IPQQDVFQRI TSAKSQKIAL WGSFLGASIY FCFTFVPMFI AYSATLIDPD LFKGLLETDS
QLVLPTLVLQ HTPVFAQAIF FGAVLSAIMS CSSATLLAPS VAFSENIVRG LFPHMGDHEF
LRVMRASIVV FAGIVLGFAL YSNASIFKMV ENAYKITLAG AFVPLFFGAF WKRATTQGAL
AAILGGLSSW ILVEVLVSVS GETAGRGDYA YALANAGQLV PPQLIGLGVS ILGMVAGSLL
PQWVGHPLPQ QDIHEALRHR AAAETHHATE HQHHH