Gene Tery_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2800 
Symbol 
ID4245334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4342197 
End bp4343225 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content38% 
IMG OID638107852 
Productrare lipoprotein A 
Protein accessionYP_722449 
Protein GI113476388 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0797] Lipoproteins 
TIGRFAM ID[TIGR00413] rare lipoprotein A 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.472757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.158248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACA AAATTTGGAC TGGTCTTATG ACTTTCCTAA TGTTTTCTCA TGTAGGTGCA 
ACTTACTGTT ATACCTATGT TCTCCGGGAA GGCAAAACTG ATCAGCCGAA CGAACCAATA
GATAGAAATT TAGAAACACA TGATAATTTA CCAAAAACCT TTGCTAACAG CCAAACAGAT
ATACTAAAAG TTAGAAAACT TAGGTTTAAC ACAAAAAAAG AGCCCAATAG CATTACTAAA
ATTAAAGCTC ATGAACTGCT TGATCGCCAA GCGGCAACTC TCTATGTTCG CAATATTCCA
GTTCTGACTT TTCTTGACTC TGATAAAACT AATAAGAAGG CTGATCATCA AGGTTCAAAC
TCTGCGGCCA AAGCAGCTAC TATTCAATGT TGCAATTCAA ACAAACAAAA AGATCCCCTG
TCGGAAGCAT TAGTAGTTGC GTCCCGACTA GACCAATTTA TTCAGGAAAG TCCCAATTTA
AAGATTACAG CAAAGTGGGA TGCAGATACC AAAGGCTACA TAATTCTGGC CAATGACCAA
AAATTAGTAG AAATAAATGA GCAGACTATC CTACCAGATA CTACCAACGA CTTAGCAATA
GATACTTTAC AAGCAACGAA CCGCCTACGA CGACAAGTTA ATAATGCTCC TCCTTTAAAA
GAAATAGGAG GTAAACCACA ACTATTGCCA CAACCGAAAG CAAAAAAAGT TGCTCCTAAA
CCAATTCAGT TCCAATTTCA AGGTTGGGCA TCTTGGTATG GCCCTGGATT TCATGGTCGC
TTAAGCGCTA ATGGCGAGCG TTTTAATCAA TACGCAATGA CTGCTGCTCA TAAAACCCTA
CCTTTTGGAA CAAAAGTACG AGTGACTAAT TTACATAATG GTAGTTCAGT TATTGTGCGA
ATTAATGACC GTGGTCCATT TATTCCAGGT AGAGTTATTG ATTTATCCTC TGCTGCAGCA
GATGTTTTAG ATATGATTCA AATTGGAGTT GCTCCTGTCA AAGTAGAGGT TATGGCTCGT
GAATCATAA
 
Protein sequence
MKHKIWTGLM TFLMFSHVGA TYCYTYVLRE GKTDQPNEPI DRNLETHDNL PKTFANSQTD 
ILKVRKLRFN TKKEPNSITK IKAHELLDRQ AATLYVRNIP VLTFLDSDKT NKKADHQGSN
SAAKAATIQC CNSNKQKDPL SEALVVASRL DQFIQESPNL KITAKWDADT KGYIILANDQ
KLVEINEQTI LPDTTNDLAI DTLQATNRLR RQVNNAPPLK EIGGKPQLLP QPKAKKVAPK
PIQFQFQGWA SWYGPGFHGR LSANGERFNQ YAMTAAHKTL PFGTKVRVTN LHNGSSVIVR
INDRGPFIPG RVIDLSSAAA DVLDMIQIGV APVKVEVMAR ES