Gene Lcho_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2303 
Symbol 
ID6163555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2500114 
End bp2501724 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content73% 
IMG OID641665073 
Productprotein of unknown function DUF894 DitE 
Protein accessionYP_001791334 
Protein GI171058985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCAC CCGCCAAGCC GCCCCTGGTC GACCGCCTGC CGCCCACCCT GCAGGCGCTG 
ACGCTGCCGG TGTTCCGGAT GTTGTGGCTG GCCTGGCTGG CCGCCAACCT GACGATGTGG
ATGAACGACG TCGCGGCCGC CTGGCTGATG ACGCAGCTCA CCGACAGCGC GGTGATGGTG
GCGCTGGTGT CGGCGGCGTC GACCCTGCCG GTGTTCCTGC TCGGCATCCC CAGCGGTGCG
CTGGCCGACA TCATCGACCG CCGGCGCTGG TTCGCCGCCA CCCAGCTGTG GGTCGCCAGC
GTGGCGGTGC TGCTGGCGCT GCTGAGCCTG GGCGACGGGC TCGATGCCCA GCTGCTGCTG
GCGCTGACCT TTGCCAACGG CATCGGCCTG GCGATGCGCT GGCCGGTGTT CGCGGCCATC
GTGCCCGACA TCGTGCCGCG CGAGCGGCTG TCGGGGGCGC TGGCGCTGCA GGCGCTGGCG
ATGAACATCT CGCGCGTGGT CGGGCCGATG TTTGCCGGCG CGCTGCTGGC GGCGTCGGGC
AGCACCGCGG TGTTCGTGTT GAACGCGCTG CTGTCGCTGG TGGCGTTTGC GCAGGTGCTG
CGCTGGAAGA GCCCGCCGCG CGTCAGCGCC TTGCCGGGCG AGCGTTTCGT CGGTGCCATG
CGGGTCGGCC TGCAGCATGT GCGGCAGAGC CCGCGCATGA AGGCGGTGCT GGTGCGGGTG
TTCCTGTTCT TCGTGCAGAG CATGGCGCTG ACCGCGCTGC TGCCGCTGGT GGCGCGCCGG
CTCGGCAGCG GGGCCGGCGG CTTCACGCTG CTGGTGTCGT CGATGGGGGT GGGCGCGGTG
GCGGCGGCGC TCACGGTGCC GCAGTTGCGC GAGAAGGTCA CGCGCGACGC CATCGCGCTG
TGGGGCACGC TGATCGTCTC GACCGCCACG CTGGCGGTGG CGTTTGCGCC GGCCTTGTGG
ATCGCCGCGC TGGCGATGGT GGTGGCCGGC GTGGCCTGGA TCAGCACCGC CAACACCATG
ACCATGTCGG CCCAGCTGGC CTTGCCCAAC TGGGTGCGGG CACGTGGCAT GTCGGTCTAT
CAGATGGCCT TGATGGGCGG CTCGGCCGGC GGCGCGGTGC TGTGGGGCCA GGTGGCCGAG
CGCGCCAGCG TGCCGGCGGC GCTGGTGACC GCCGCGGCGC TGGGCCCGCT GGTGCTGCTG
CTGACGCGGC GCCTGAGCCT GGGGGGCGGG CAGGACGAAG ACCTGAGCGC GATGCCGGCC
CACCCGGTGC CGGCGCCGGC CTTCAGCTTC GAGCCCGACC GCGGGCCGGT GATGGTGACG
GTCGAGTACC TGATCGACCC GGCCGACGGC GACGCCTTCC GCGCCGTGAT GCAGGACACC
CGGCGCGCGC GGCTGCGCCA GGGCGCGCTG TCGTGGGGGC TGTTCCGCGA CACGGCGCAG
ACCGGGCGCT ACATCGAGTA TTTCGTCGAC GAGTCCTGGG TCGAGCACCA GCGCCGCATG
GAGCGTTTCA CCGCCGCCGA CATCGGCCTG CGCGACCGCC GCCTGGCCTT CCACCGCGGC
AGCGAGATCC CGCGCGTGAC GCGCTATCTG GCCGAGGATC TCGATGTCTG A
 
Protein sequence
MNPPAKPPLV DRLPPTLQAL TLPVFRMLWL AWLAANLTMW MNDVAAAWLM TQLTDSAVMV 
ALVSAASTLP VFLLGIPSGA LADIIDRRRW FAATQLWVAS VAVLLALLSL GDGLDAQLLL
ALTFANGIGL AMRWPVFAAI VPDIVPRERL SGALALQALA MNISRVVGPM FAGALLAASG
STAVFVLNAL LSLVAFAQVL RWKSPPRVSA LPGERFVGAM RVGLQHVRQS PRMKAVLVRV
FLFFVQSMAL TALLPLVARR LGSGAGGFTL LVSSMGVGAV AAALTVPQLR EKVTRDAIAL
WGTLIVSTAT LAVAFAPALW IAALAMVVAG VAWISTANTM TMSAQLALPN WVRARGMSVY
QMALMGGSAG GAVLWGQVAE RASVPAALVT AAALGPLVLL LTRRLSLGGG QDEDLSAMPA
HPVPAPAFSF EPDRGPVMVT VEYLIDPADG DAFRAVMQDT RRARLRQGAL SWGLFRDTAQ
TGRYIEYFVD ESWVEHQRRM ERFTAADIGL RDRRLAFHRG SEIPRVTRYL AEDLDV