Gene Elen_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3096 
Symbol 
ID8417432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3600916 
End bp3602211 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content62% 
IMG OID645026076 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003183427 
Protein GI257792821 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.00556654 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTACCA CTACAGCCAA AAAGAAGGGG ATCCACTTCG CATGGTTCGT CCTCATCGGC 
GTTCTGCTGA TGATGGGTCT GTGCCGTGGC GGCATCAACA GCGGCATGGG CTTGTTCTTC
CAGCCTATCA GCGCCGACAT GCACATCGGC GTGGGCGAAG TGGCAATGAT GTCGTCCATC
TCGGCATTGA TCACTATGTT CTGGAGCCCG TTCGCGGGTC GCCTGCTCGA CAAGTTCGAC
ATCCGCATCA TCACCGTAGT CGCCGTCGCC ATCCAGGCGG GCTGCTTCGC CGCGCTGTCG
CTGGTCGACG CAGTGTGGGG CTTCTACGCG CTGGCCGGCA TCATGGCGTT CGGCTCGGTG
TTCGCCACGC AGCTGGTGGG TCCCATGATG ATCAACCGCT GGTTCAAGGA TAAGAACGGC
CTGGCCATGG GCATCATGAT GAGCTTCGTG GCCATCTGCT CCGCCGTTCT CTCGCCGGTC
GTCGCCTCCA TCATCGCATC GAACGGCTGG CGCATGGGCT ATATCGTGCT GGGCGTGCTC
GCGCTGGTTA TCGTGATCCC GGCCGTGCTC ATCTGGTTCC GCACGCCCGA GCAGAAGGGG
CAGCTGCCCC TCGGCGCCAC CGAGGCCGAC ATCGAAGCCG CCAAGAACGC CGATCCGAAA
GCCGCCGCCG AGGCCGCGAA GAACCTGCCG GGCCTCACGT CCAAGCAAGC GCTGAAGACC
CCCACGTTCT GGTTCTTCTT CATCTTCATG GTGCTGCTCA CCGGAACGCT GGCGTTCGCC
TCCATCGTTC CCACCCTGGC TATCGAAGCT GGTTTCGATA CCGTGACCAG CGGATTCGCT
ATGACGGCGT ACATGATCGG CACGGCTATC GCCGCCGTCG TGTTCGGCAC GATCTCCGAC
AAGCTGGGCC CCCTCAAGGC CACGATGGTG GCGTGCGCGT GCGGTTTCAT CGCCCTCTTG
GGCCTCATCT TCTTCCGCAC GAACCTGTAC ATGTTCTTCG GCTCGCTGTT CTTCTACGGC
TGCCTGTCCG CCACGCTGGG CGTCATCGGC CCGCTGGTGC TAGGCACGCT GTTCGGCCAG
AAGGAATTCG GCTCCATCTA CGGCATCGTC ATGATGGCCA CCGGCATCGG CTCCATGATC
CTCATCCCGG CGTACGGCTT CATCTACGAT GCGACGGGCA GCTTCACGCC CGCCCTCATC
ATGATCTTCT GCTTCATCGT CGTCTGCCTG ATCTCCATGA TCATGGCCTT CAAGACCGGC
AAGAAGGTTC AGGCCATGTG GATGCCGAAG GCGTAA
 
Protein sequence
MATTTAKKKG IHFAWFVLIG VLLMMGLCRG GINSGMGLFF QPISADMHIG VGEVAMMSSI 
SALITMFWSP FAGRLLDKFD IRIITVVAVA IQAGCFAALS LVDAVWGFYA LAGIMAFGSV
FATQLVGPMM INRWFKDKNG LAMGIMMSFV AICSAVLSPV VASIIASNGW RMGYIVLGVL
ALVIVIPAVL IWFRTPEQKG QLPLGATEAD IEAAKNADPK AAAEAAKNLP GLTSKQALKT
PTFWFFFIFM VLLTGTLAFA SIVPTLAIEA GFDTVTSGFA MTAYMIGTAI AAVVFGTISD
KLGPLKATMV ACACGFIALL GLIFFRTNLY MFFGSLFFYG CLSATLGVIG PLVLGTLFGQ
KEFGSIYGIV MMATGIGSMI LIPAYGFIYD ATGSFTPALI MIFCFIVVCL ISMIMAFKTG
KKVQAMWMPK A