Gene Ent638_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4236 
Symbol 
ID5110414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp48243 
End bp50198 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content58% 
IMG OID640480853 
Productvon Willebrand factor, type A 
Protein accessionYP_001165515 
Protein GI146284562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.644313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA GTTATGGATT CAGCGGCCTG TTGTGCGCGG CGGCATTAAG TTTGCCTGCT 
CACGCCGCTG ACACAAAGCC GCTGCTGCAG GAAGGGAAAC ACACGCTGTT CCAGCGCGTG
CTCACCTATC CCGGTTGCAT GCTGGCCGCC AAAGCGGGCG AAGCGGGTAA AGAACAGCCT
GCGTTCAGCC GCTTTTACGT TTATCAGCGT GAAAAACAGG GCCAGGACGA GTGGTTGCAG
GTCGGGCCGG ACAGTTTCGG CCACGTCTCT GGCTGGATGA AGTCCTCCTG CACCGTGGAC
TGGAAAGTGC AGCTCACGCT GGCCTTTACC AACCCCGCCG GACGCCACCC GATGCTGTTC
TTCAAAGAGA AGGGCGATGT GGAGTCGCTG CTGAACAACG CCAAACCTGC TGCCGCGCTG
GAGCCGATGA TCGCCAGTCT GAATCAGAAA AAGCCGGTGC CACAGGTACT GGCTCGTGAA
CCAGACTACA TGGTCGATCA GCTGAAAAAC TTCTACCTGC TGCCCGTACT GGGTTCTGAC
GATATCTTTA CCGATACCGG TTTCCAGGTA CGGGTGCTGA ACGTGGCTTC GGTCAGTGAG
AATGGCAGCG CCACTACCTC GGCTAAAGCG ACCGACGAGA AGAACATGAT GAAGGGCTTC
TCGGCGTCGG TGGTGTTCGT GATCGACTCG ACTATCTCGA TGGGGCCGTA CATTGATCGC
ACTAAAGAGG CGATCGACAA GATCTACAAA CAGATCGAAA AAGAGCAGTT ACAGGACAAA
GTGAAATTTG GCCTGGTGGC CTACCGTTCC AGCGTGAAGG CGGTGCCTGG GCTGGAATAC
GATGCCAAAA TGTATGTCGA TCCGAACACG GTGAAAGACG GCAAAGATTT CCTCGCCAAA
GTGCATGACC TGAAGCAGGC GACCGTCTCC AGTAGCAAAG TGGATGAAGA CGCCTACGCC
GGGGTGATGA CCGCGCTGGA TAAAGTCGAC TGGACGCAGT TTGGTGCGCG CTATGTGGTG
TTGATCACCG ATGCGGGCGC ACTGGACGGC ACCGACAGCC TATCTTCCAC CCATCTGGGC
GCGGAGCAGG TGCGGCAGGA AGCGGCGTAC CGTGGCGTCG CGCTGTACAC CCTGCACCTG
AAAACACCGG ACGGGAAGAA AAATCACGCC TCCGCCGCTG CGCAATATCA GGAGCTGACG
CTTAATCCGT TCCTGCATAA GCCGCTGTAC TACCCGATCG ATTCCGGCGA TGTGAACAGC
TTCGGCACCA TCGTCGACAG CCTGTCAAAT GCCATTACCG CCCAGATCAA AACCGCCTGG
AGCGGGGAAG AGACCGCAGG CAGCGCCCTG GGCGCCAGCC CGGAATACGC CGGTAAGAAA
GCCGATCCGC TGCTGAGCGA TGCCGAGAAA CTGAGCAAAG CCATGCGTCT GGCGTACTTG
GGTGAAAAGC AGGGTACGCA GGTGCCGCCG GTGTTTAAAT CGTGGATTAG CGATCGCGAT
CTGGTGAATC AGAACCTCCC GGCCACCGAA GTCCGCGTCC TGCTGACCAA AAGCGAGCTG
AGCGATCTGA ACGACGTGAT GAAGAAGATC GTCAATGCCG CCAACGAAGG GATGATCTCG
CCGGATGACA TGTTCGCCAG CCTGCGATCT CTGGCCGCGA CCATGGGCAA CGATCCGAAC
CAGGCAAAAG GTAAAAATGC GACCCGCCTC GGTGAAATGG GCCTTCTGGG CGAGTACATC
GAAAGCCTGC CTTATCTGAG CGAAGTGCTG AGCCTCGATG AAGAGACCTG GAAGAGCTGG
GATGGGCTGG AGCAGGAGCG TTTCATTCGC CGCCTGAACA CCAAACTCAA CTATTACCAG
CGCTACAACG AAGATGCTGA TCGCTGGATC GCGCTGGCGC CTGACAGTGA CCCGCGGGAT
AACGTCTACC CGGTTCCGCT GGAGAACCTG CCCTGA
 
Protein sequence
MKKSYGFSGL LCAAALSLPA HAADTKPLLQ EGKHTLFQRV LTYPGCMLAA KAGEAGKEQP 
AFSRFYVYQR EKQGQDEWLQ VGPDSFGHVS GWMKSSCTVD WKVQLTLAFT NPAGRHPMLF
FKEKGDVESL LNNAKPAAAL EPMIASLNQK KPVPQVLARE PDYMVDQLKN FYLLPVLGSD
DIFTDTGFQV RVLNVASVSE NGSATTSAKA TDEKNMMKGF SASVVFVIDS TISMGPYIDR
TKEAIDKIYK QIEKEQLQDK VKFGLVAYRS SVKAVPGLEY DAKMYVDPNT VKDGKDFLAK
VHDLKQATVS SSKVDEDAYA GVMTALDKVD WTQFGARYVV LITDAGALDG TDSLSSTHLG
AEQVRQEAAY RGVALYTLHL KTPDGKKNHA SAAAQYQELT LNPFLHKPLY YPIDSGDVNS
FGTIVDSLSN AITAQIKTAW SGEETAGSAL GASPEYAGKK ADPLLSDAEK LSKAMRLAYL
GEKQGTQVPP VFKSWISDRD LVNQNLPATE VRVLLTKSEL SDLNDVMKKI VNAANEGMIS
PDDMFASLRS LAATMGNDPN QAKGKNATRL GEMGLLGEYI ESLPYLSEVL SLDEETWKSW
DGLEQERFIR RLNTKLNYYQ RYNEDADRWI ALAPDSDPRD NVYPVPLENL P