Gene Tery_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1244 
Symbol 
ID4242171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1926211 
End bp1927449 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content35% 
IMG OID638106454 
Productvon Willebrand factor, type A 
Protein accessionYP_721065 
Protein GI113475004 
COG category[R] General function prediction only 
COG ID[COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.427731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.129489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA ATCTACGTTC GGCTCTCAAT GATACTAATA TTGATGCTTC TCAATCTTTA 
TCTCAACGGC AAGTGGCCTT ATCTATTTCG GTGATGGCTA ATCAGCTAGA AAGGACTGTA
CCATTAAATT TATGTTTAAT TTTAGACCAC AGTGGTTCTA TGGAAGGAAG ACCACTAGAA
ACGGTTAAAC AAGCAGCAGT ACAACTTGTG GAAAAATTGA AGGAAGGCGA TCGCCTTTCG
GTTGTAGCTT TTGACCATCA AGCTCAAGTA ATTGTTCCTA ATCAGATGAT CAATGATTCT
GCTAGTATTA AGGGTAAAAT TAATAAATTA AGAGCCTCTG GTGGTACTGC TATTGATAAA
GGTTTAAAGT TAGGAATAGA AGAATTAAAT AAGGGTAGAA AAGAGTCTAT TTCCCAGGCT
TTTATATTAA CTGATGGAGA AAATGAACAT GGGGATAATG ACCTTTGTCT CAAGCTAGCA
AAGTTAGCAA CAGACTATAA TATTACTCTA AATTCTCTAG GATTTGGTGA TGATTGGAAT
CAAGATGTTT TGGAAAAAAT TGCTGATGCT GGAGGGGGAA ATCTTTCCTA TATTCAACAA
CCAGAACAGG CAATAGAGGA GTTTAGTAAA TTATTTAATC GCATTAAATC TGTAGGAATT
ACTAACTCTT ATTTGCAATT TTATTTAATG CCTAAAGTGA GGTTAGCAGA ACTTAAACCT
ATTGCACAAG TGGCACCAGA TACTATTGAG TTGCCAGTAA AAAAAGAGGG TAATGGGTTT
ATAGTTAGAC TGGGAGATTT AATGAAAGAT ATAGAAAGGG TGGTTTTAGT CAATACTTAT
ATTGGGCAAT TACCAGAAGG AAAACAAGCA ATTGCTCAAT TACAAATTCG TTATGATGAC
CCTGCTCAAA ATCAAGAAGG TTTACTTTCA GAATCAATTT TAGTTGAAGC TAATTTTATG
GAAAAATACC AGCCTCAAGT TAACTCTCAA GTACAAAATC ATATTTTAGC TTTAGCAAAA
TATAGGCAAA CTCAAATAGC TGAAACAAAA TTACAACAGG GTGATAGAGC AGGTGCAGCT
ACAATGTTAC AAACAGCAGC TAAAACAGCA TTACAAATGG GAGATACAGG AGCTGCAACT
GTTTTACAAA CTAGTGCTAC TCGCTTACAA GATGGGGATA AACTTTCAGA AATGGAACGT
AAAAAAACAA GAATTGTTTC CAAAACCATT TTAAAGTAG
 
Protein sequence
MKVNLRSALN DTNIDASQSL SQRQVALSIS VMANQLERTV PLNLCLILDH SGSMEGRPLE 
TVKQAAVQLV EKLKEGDRLS VVAFDHQAQV IVPNQMINDS ASIKGKINKL RASGGTAIDK
GLKLGIEELN KGRKESISQA FILTDGENEH GDNDLCLKLA KLATDYNITL NSLGFGDDWN
QDVLEKIADA GGGNLSYIQQ PEQAIEEFSK LFNRIKSVGI TNSYLQFYLM PKVRLAELKP
IAQVAPDTIE LPVKKEGNGF IVRLGDLMKD IERVVLVNTY IGQLPEGKQA IAQLQIRYDD
PAQNQEGLLS ESILVEANFM EKYQPQVNSQ VQNHILALAK YRQTQIAETK LQQGDRAGAA
TMLQTAAKTA LQMGDTGAAT VLQTSATRLQ DGDKLSEMER KKTRIVSKTI LK