Gene Dfer_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3022 
Symbol 
ID8226596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3695729 
End bp3698380 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content58% 
IMG OID644930853 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003087402 
Protein GI255036781 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[S] Function unknown 
COG ID[COG0591] Na+/proline symporter
[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR03548] cyclically-permuted mutatrotase family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.61129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.844759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTGCA ACATCCGACT ATTCCTAATC CTGACCTGCC TGCACCTGAT GTGGCCCGAT 
AACGCATTGG CTAACGACAC GCTCACCAAC CGGATCGACT GGTCGGCGGC GGCCAGCCTT
CCCGCCGCGG CAGGGGAAGA CGCAAACCCC GGTGTGGCCG GTGCGTTTGC AGGCTGGCAT
AACAGTGCCC TGCTCATTGC CGGCGGGGCC AATTTTCCTG CCGGTATGCC CTGGGAAGGT
GGCCGCAAAG CCTACCACGA CCGCATTTAC GTAATGAAAA AAAGCGGCGC GTCCTATCGT
TGGATCGACG TGCGCAATCC GCATTTGAAA CAAAAAACCG CTTACGGGGC CAGTGTATCG
GTCCCAGGCG GCGTGGTGTG CATCGGCGGA GAAAGCGAAG CGGGTGCCAG CCGGAATGCA
TTCCTCATGC AATGGGACGA CGCTGGTAAG CAGGTAGTTT TTAAGCCATT ACCCGACCTG
CCCGTACCGC TCGCCAATGC AGCCGCGACG AGCATTGGTA ACGTGGTGTA CATCGCGGGC
GGGGAAAGCA GTGGTAAGCC TTCGAATGGC TTTTTCGCAT TAGACTTGTC CGAGGCTGAC
CCGCAATGGC AGACGCTGCC TGCGCTGCCG GTCGCATTGT CGCACGCGGT GGCCGTGGCG
CAGAGCGATG GCAAGGCTTC CTGCGTGTAT GTGCTGGGTG GGCGCAGCGC CAGCGCGTCG
GGCGTGAGTG CCTTGCACGG GACCAATTTC CGCTATGATC CTGCCAAACG ACAATGGCAG
CAGCGCGCGG GGATCAGCGA CGGAACGGCA CCGACCACGC TTTCGGCTGG AACGGGCCTC
GCCAGCGGTG CTTCCTATAT TGTGTTGTTC GGAGGGGATA ATGGGAAGGT GTTTAATCGG
ATTGAAACCT ATAATGCCCG CATAGCCGCC GCAACCAACG ACGCAGAGAA GCAAAGGCTC
CAAGCCGAAA AACTGCCGCT GCTGCAACAG CACGTCGGTT TTAGCAGAAA TATTTATTTG
TACAACACCA TTACCGACGC CTGGACCGTC ACCGGCGAGC TGCCGGAAGC CGCGCAGGTG
ACGACCACCG CCGTGAAAGC CGGCGACGAG ATTTTCATCC CGAGCGGGGA GGTGCGGCCC
GGCGTGCGTA CGCCCGTGGT CATGCGCGGG AGCGTGCGGA GCCAATCATC GTTCTCCTGG
ATTGATTCGG CGGTGCTATT CATTTGTTTC CTGCTGATGA CGGCCGGGCG GTTCCTGTTT
ACGGGTAAAA CCAGCAACAC CGACGACTAT TTCAAAGGCG GCGAGCGTAT TCCGCAATGG
GCGGCGGGGA TCAGCATTTT CGGGGCCAAA CTGAGCGCGA TCACCTTCAT GGGCATTCCG
GCCAAGACAT ACGCCACGGA CTGGACGTAC TTTTTCCTGC TGATGACGAT TATCATGGTG
ATGCCGCTCG TGGCGGGCTA TTTCATCCCG TTTTACCGGC GGCTGAACGT CACGTCGGCA
TATGAATACC TGGGCAAGCG CTTCAACACC GGCTCGCGGA TGCTGGCTTC GGCATTGTAC
GTGTTGCTGC AACTAGGGCG AATGGGCATT GTGGTGCTCC TGCCGAGCAT CGCGCTTACG
CTCGTGACGG GCATTGATAT CAACATCTGC ATCATCATGA TCGGCGTGAT CAGCATATTC
TTCACAGTGA AGGGCGGTAT CGAGGCGGTG ATATGGGTGG AGGTGATCCA GGTACTGATC
CTAGCAGCGG GCGCATTGTT CTGCCTGTTT TACCTGCCGT TTCAAATCGG CAACTGGAAT
GCGGCCGCCG ACGTGCTGCA AAATGCCGAA AAGCTGAAAG TATTCGACTT CCGCTTCGAT
TTCACGGAAC CGACTTTCTG GGTGGTGGTG ATCGGCGGGC TGGCGATCAA CCTGCTCACC
TACGGCACCG ACCAAACCAC CGTGCAGCGC TACCTGACGA CCAAATCGGA GAGCGAGTCC
GTGAGGAGCC TCAAACTGGG CGCCTGGCTC ACGTTACCTT CCACGCTGGT ATTCTTTTCA
ATCGGCACGC TGCTGTTTTT GTTCTTTCGC GAACAGCCGG CGTCGGTAAA CATGGCGCTG
GACAATGTCG ATAATATTTT TCCATGGTAC ATTGTGAGCC AGCTACCGGC CGGGCTTTCG
GGCCTGCTTA TCGCCGGTAT TTTCGCCGCC GCCATGAGTA GCACCGAGGC CAGCATGAAC
TCCACCGCCA CCTTGCTCAC CACCGATTTT TACCAAAAAC TATACCCCGG CGTGACGCCG
AAGCAGACCC TGTTCTTCGC CCGCGCGGCC ACGCTGCTGC TCGGGATATT TGTCACCTGC
ATCGCATTAT ACATGGCGCA CAAGGGCGTG TCGTCGCTGT GGGATCGATT CAATACGATT
TTGGGCCTGT TCACAGGCTG CATTGGCGGG GCATTTGTGC TCGGGATATT CACAACCAAA
GCCAGCGGCA ACGGCGTTAT GGCCGGTATG GCGCTCAGCT GTGTCACCCA GCTGCTCATC
CAGCAGTATA CCGACATCCA TTTGCTGATG TACGCATTCA CCGGGCTGGT AAGCTGTGTT
GGATTTGGCT ATGTTTTAAG TTTGCTGATG CCCGCAAAGC GAGACCTGGC GGGTTTAACG
ATCTACGAAT GA
 
Protein sequence
MLCNIRLFLI LTCLHLMWPD NALANDTLTN RIDWSAAASL PAAAGEDANP GVAGAFAGWH 
NSALLIAGGA NFPAGMPWEG GRKAYHDRIY VMKKSGASYR WIDVRNPHLK QKTAYGASVS
VPGGVVCIGG ESEAGASRNA FLMQWDDAGK QVVFKPLPDL PVPLANAAAT SIGNVVYIAG
GESSGKPSNG FFALDLSEAD PQWQTLPALP VALSHAVAVA QSDGKASCVY VLGGRSASAS
GVSALHGTNF RYDPAKRQWQ QRAGISDGTA PTTLSAGTGL ASGASYIVLF GGDNGKVFNR
IETYNARIAA ATNDAEKQRL QAEKLPLLQQ HVGFSRNIYL YNTITDAWTV TGELPEAAQV
TTTAVKAGDE IFIPSGEVRP GVRTPVVMRG SVRSQSSFSW IDSAVLFICF LLMTAGRFLF
TGKTSNTDDY FKGGERIPQW AAGISIFGAK LSAITFMGIP AKTYATDWTY FFLLMTIIMV
MPLVAGYFIP FYRRLNVTSA YEYLGKRFNT GSRMLASALY VLLQLGRMGI VVLLPSIALT
LVTGIDINIC IIMIGVISIF FTVKGGIEAV IWVEVIQVLI LAAGALFCLF YLPFQIGNWN
AAADVLQNAE KLKVFDFRFD FTEPTFWVVV IGGLAINLLT YGTDQTTVQR YLTTKSESES
VRSLKLGAWL TLPSTLVFFS IGTLLFLFFR EQPASVNMAL DNVDNIFPWY IVSQLPAGLS
GLLIAGIFAA AMSSTEASMN STATLLTTDF YQKLYPGVTP KQTLFFARAA TLLLGIFVTC
IALYMAHKGV SSLWDRFNTI LGLFTGCIGG AFVLGIFTTK ASGNGVMAGM ALSCVTQLLI
QQYTDIHLLM YAFTGLVSCV GFGYVLSLLM PAKRDLAGLT IYE