Gene Ent638_1308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1308 
Symbol 
ID5114271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1436362 
End bp1437690 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content53% 
IMG OID640491495 
ProductPTS system lactose/cellobiose family IIC subunit 
Protein accessionYP_001176040 
Protein GI146310966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA CAAAAATTAC ACCTGCGATG CAGTCTTTTG TCGATAAGTT TGTTGAGTTC 
TCAGCGCGCC TGGCAAATCA GGTGCACTTA CGTTCTCTGC GCGACGCTTT TGCCACGGTC
ATGCCCATTT TCATTCTCGC CGGACTTGCG GTGCTGGTGA ATAACGTGGT GTTTCCGTGG
GTGCTGGAGG GCGACACGCT TGCCCAGTTT AAAGTGTGGG GCGAGGCGAT CATTAATGGC
ACGCTGAATA TCGCGGCGCT GCTGTTAGCG CCGATGATTG CCTGGTCGCT GGCACGCAAC
AAAGACTTCG ACAATCCGGT CTCGGCAGTG GTTATCGCTG TCAGCAGTTT TATCATTATG
ATGCCGATGC AGTTACAGCT GGTTCCCGTG GGGAGTCAGA CGGCGGTGAA CGTCACGCAG
GTGCTGACGT TCACCAATAT TGGCTCGACG GGGATCTTTG CGGGTGTGCT CATTGGCCTG
CTTTCTACCG AAATTTTTAT CGCCATATCA CGCTTAAAAT CGCTGCATAT TTCGCTGGGA
GAGAACGTGC CACCCGCGGT GAGTAAATCG TTCACCACGC TGATCCCGAC GATTATTACG
CTCTCTCTGT TTGCCGTACT GGCGGCGCTG CTCGCCAACG TGCTGCACAC GGACCTGATT
CACCTGATCA CCACGTTTAT CCAGCAGCCG CTGCGGTTGA TCAATACCAG CCTGCCGGGC
ACGCTTTTTA TCTACAGTTT TGGTAATTTC TTGTTTACGC TGGGGATTCA TCAGTCGGTC
GTGAACAGCG TGATTCTGGA ACCGTTCTTA TTGATCAACA CCAACGAGAA CATGATCGCG
TTCGCTAACG GACAGCCGAT TCCGCACATT ATCAATAATA TCTTTGTCCC TACGTTCGGC
ATGATTGGCG GAACCGGGAG TACCATTTCC CTGTTGATTG CGATTTTTAT CTTTGCACGT
CAAAAGTCAT CAAAGCAGGT GGCGCGGCTG TCGCTGGCGC CGGGATTGTT CAATATCAAC
GAGCCAGTGA TTTTTGGCTT GCCGATCGTG TTCAACCTGC CGCTGATGAT CCCGTTTGTC
CTGCTGCCGG CCATCGGCAT TTATTTTGCC TGGCTCTGTA CCACGCTAGG GCTGATGTCG
CGCTGCGTGG TCATGATCCC ATGGACCACG CCGCCAATAC TCAGCGCCTG GCTGGCGACG
GCGGGGGACT GGCGCGCGGT GGTGGTGCAG TTGGCAATCA TTGTATTTGG TGTATTCTTC
TACCTGCCTT TCCTCAAGAT TGCTGAGCGA GTGGCGTTAA AAAACAGTGT GATAGCGGAT
CAATCATAA
 
Protein sequence
MSDTKITPAM QSFVDKFVEF SARLANQVHL RSLRDAFATV MPIFILAGLA VLVNNVVFPW 
VLEGDTLAQF KVWGEAIING TLNIAALLLA PMIAWSLARN KDFDNPVSAV VIAVSSFIIM
MPMQLQLVPV GSQTAVNVTQ VLTFTNIGST GIFAGVLIGL LSTEIFIAIS RLKSLHISLG
ENVPPAVSKS FTTLIPTIIT LSLFAVLAAL LANVLHTDLI HLITTFIQQP LRLINTSLPG
TLFIYSFGNF LFTLGIHQSV VNSVILEPFL LINTNENMIA FANGQPIPHI INNIFVPTFG
MIGGTGSTIS LLIAIFIFAR QKSSKQVARL SLAPGLFNIN EPVIFGLPIV FNLPLMIPFV
LLPAIGIYFA WLCTTLGLMS RCVVMIPWTT PPILSAWLAT AGDWRAVVVQ LAIIVFGVFF
YLPFLKIAER VALKNSVIAD QS