Gene Ent638_4241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4241 
Symbol 
ID5110358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp58466 
End bp59671 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content53% 
IMG OID640480858 
Productvon Willebrand factor, type A 
Protein accessionYP_001165520 
Protein GI146284567 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.33156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGAA TAACAAGGAC TGTTGAGCAC AGCGAATCTG TGCAAAATTA TATCAACCTC 
TACCCTTCCT TTATTTCTCT TGTTGAGCAG CATCTCTCTG CTGGTTATAA AAATTTATTT
GCCAGACCAC ACCAAGCCTC GGATGGTGCG ATTGAGTGGT ACAGCGATAT TAACGGACAA
CCTGTTTCAC TCATCGCGCT TCCACCTGCA GAAAAAGAGC ATGCGCGGGT CTTGTTACAG
CAAAAGCTAG TGGTCATTGA CCAACTTTAT CAAAAGCTCT CGATGAATAA CTCCGCGCCT
GTCGACGTTA TGCAGGTGTT GTCGCTCGCC AGTTGTCAGT CTGATGAGCG CGGCGTCTGG
GTGGTTGATG GTCAGCCAGT CATTACAGCC TGGTGCGAAA GCCCCGCCGT GCCAGTGTCG
CCGGTTGCCA GTAAGGGCCG CCGTTGGTGG TGGTTACTGC TGGCATTGCT CGTGCTGCTG
GCGTTTCTTT TGCTACGCGG TTGCATGCCG ACGGCAAAAT TACCGGTCCA GGGTGCGCCT
GCGCCAGCAC CGCTGCCGAT AAAACAGATC TGTCCGGCAA AACGGACCAA ACAGCAAGCC
CCTGAAATGG TGTTAATTTT CGATGCATCA GGTTCTATGT CCATCAGTAT GGACATTACG
CCGGATGAGC TGCGCCGTCT GATGCAGGAT AGGCCAGTAA AGAATTTTGA TCGCGAACCT
CGCCGTATTA GCCTTGCGCA CCGCTCGGCA AAACAGCTGA TCGACGAAGT GCCAAAAGAC
ATGGATATCA GCCTGGTATC GGCGGCAACC TGCCAGCAAG TCTCTGTGAC GCCAGCCCTT
TCTTTTGCCC AGCGCGATGA ACTGAAATAC GCCATCGATA ACATTCAACC GGTGGGGAAA
ACCGCGCTGG CAGAAGCGCT GGAGAAGGCG GGCAAGCTGG TCGATGGCGT GGATCGTGAT
GCTATCATCG TGCTCATCAC CGACGGCGAA GAGACCTGCG GCGGCGATCC GTGCGTGGTC
GCGCAGCAGC TCAAGCAGCA GAAACCGCGC CTGCAGGTCA ACGTGGTCGA TATTATGAAT
ACCGGTGCCG GAAACTGTAT TGCCAGCCAG ACCGGGGGGT CGGTTTATGC CGTAAACAAT
ACCCATGAAT TTAATGAAAT GATGAACCAG GCAATTAAGG AATATATCCC CGAGGCCTGC
GACTAA
 
Protein sequence
MIRITRTVEH SESVQNYINL YPSFISLVEQ HLSAGYKNLF ARPHQASDGA IEWYSDINGQ 
PVSLIALPPA EKEHARVLLQ QKLVVIDQLY QKLSMNNSAP VDVMQVLSLA SCQSDERGVW
VVDGQPVITA WCESPAVPVS PVASKGRRWW WLLLALLVLL AFLLLRGCMP TAKLPVQGAP
APAPLPIKQI CPAKRTKQQA PEMVLIFDAS GSMSISMDIT PDELRRLMQD RPVKNFDREP
RRISLAHRSA KQLIDEVPKD MDISLVSAAT CQQVSVTPAL SFAQRDELKY AIDNIQPVGK
TALAEALEKA GKLVDGVDRD AIIVLITDGE ETCGGDPCVV AQQLKQQKPR LQVNVVDIMN
TGAGNCIASQ TGGSVYAVNN THEFNEMMNQ AIKEYIPEAC D