Gene TM1040_3225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3225 
Symbol 
ID4075367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp222584 
End bp224017 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content61% 
IMG OID638004734 
Productvon Willebrand factor, type A 
Protein accessionYP_611461 
Protein GI99078203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACATA AAATCATGAC AGCCGCCATC TGTGCCGCAC TCGCCGCGCC TGTTGCTGCG 
CAAGACCTGA CCCGCAGTAC TTTGGTGTTG GATGCCTCTG GGTCGATGTG GGGACAGATT
GACGGTGTTG CCAAGATCAC CATCGCTCAG GACGTGATGC AGCACCTTCT AAAGACACTC
CCAGAGAACC AAGAGCTCGG CCTGATGGCC TATGGACATC GGCGCAAAGG CGATTGCAAC
GATATCGAAC AGTTGATCGC CCCTGCCGCA GGATCTCGGC AGGCCATTTC ACAGGCGGTC
ACGCAGATCA GTCCAAAAGG CAAGACACCG CTCTCTGCGG CGGTGATGCA AGCCGCTGAC
GCTTTGCGAT CCAGTGAGGA GAAAGCAACC GTCATCCTCA TCTCGGATGG CGAGGAAACC
TGTGGGCTTG ATCCTTGCGC AGTTGGCGCA GAACTCGAAG CGCGCGGTGT GGACTTTACC
CTGCATGCGA TTGGCTTCGG CATCGCGGAT GACGCAGCGC GCGCGCAGTT GCAGTGCTTG
GCTGAGAACA CCGGAGGTTT CTACCGGGAC GCGTCGAGCG CATCTGAATT GACGGCGGCT
TTGGCACAGG TAGCGGTGAC CACATCCACG CCCGACCCGG CATCGGCTAC TTTGGTAGGG
CCTGCTACGG CACAGGCGGG TCGGACCATC GACATCATCT GGCAGGGCCC TGGCGAGGAA
GGCGACTGGA TCGGGACCTT GGCCCCCGGC GCCGATGTCG GTGCATTTTC CTCACGCATT
GGGGTCGAGC ACGGCAACCC TGCGCCGATG CCAACCCCGC CGCAGGCGGG CACCTACGAG
ATCGTCTATG TGCGCGCCGA AACCGGAGCC GTCCTTGCGC GCGCGCCGCT TGAAGTAACG
CCCATGGTCG CGTCGGTCAC AGCGCCATCT ACGGGTCTCA CGGGGGGGAC TGTGACGGTC
ACATGGCAAG GGCGCGGATC AGAGAAAGAC TTTATCGGCA TCGCCCCAAA GGCCGAGGGT
TTCCCGGCGA CCACCTTCTA TTTCGAGACC ACTGGACCAG AACAGCAGGC TTTGCGCCTG
CCATCGCAGC CGGGCGACTA CGAGATCGTT TTTGTTGCCA CGACCGATCC ATGGACCGTG
CTTGATCGCA CCTCGATCAC ATTGTCAGAT CCGGATTTGT CGCTGAAAGC GCCAAGTCAA
GTGGCTGCAG GCCAGGACTT TGCCGTGTTG TGGAACGGGA TCGCGCCAAA CCCTGCGGAT
TATATCGCGC TTGCCAGGGC CGGAGACGAC CTGCCGCATC TCGTCAGCTA TGCGCATACC
GAAAGCCACC TGACCCGCCT GACCGCGCCA GAGGAGGCCG GTGCCTATGA GTTGCGCTTC
TTCTATGCCG AAGGCGACCG CATCGTGACC GCTCAGCCGA TCATCGTCGA CTGA
 
Protein sequence
MQHKIMTAAI CAALAAPVAA QDLTRSTLVL DASGSMWGQI DGVAKITIAQ DVMQHLLKTL 
PENQELGLMA YGHRRKGDCN DIEQLIAPAA GSRQAISQAV TQISPKGKTP LSAAVMQAAD
ALRSSEEKAT VILISDGEET CGLDPCAVGA ELEARGVDFT LHAIGFGIAD DAARAQLQCL
AENTGGFYRD ASSASELTAA LAQVAVTTST PDPASATLVG PATAQAGRTI DIIWQGPGEE
GDWIGTLAPG ADVGAFSSRI GVEHGNPAPM PTPPQAGTYE IVYVRAETGA VLARAPLEVT
PMVASVTAPS TGLTGGTVTV TWQGRGSEKD FIGIAPKAEG FPATTFYFET TGPEQQALRL
PSQPGDYEIV FVATTDPWTV LDRTSITLSD PDLSLKAPSQ VAAGQDFAVL WNGIAPNPAD
YIALARAGDD LPHLVSYAHT ESHLTRLTAP EEAGAYELRF FYAEGDRIVT AQPIIVD