Gene Tery_4668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4668 
SymbolpsaB 
ID4246322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7176127 
End bp7178343 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content43% 
IMG OID638109533 
Productphotosystem I P700 chlorophyll a apoprotein A2 
Protein accessionYP_724109 
Protein GI113478048 
COG category 
COG ID 
TIGRFAM ID[TIGR01336] photosystem I core protein PsaB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0126467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA AATTCCCAAA ATTTAGCCAA GACCTGGCAC AAGATCCAAC CACCCGTCGG 
ATCTGGTATG GAATTGCTAC AGCCCATGAC TTTGAAACCC ACGATGGCAT GACAGAGGAA
AATCTTTATC AAAAGATTTT TGCTTCTCAC TTTGGCCATC TAGCAATCAT TTTCCTCTGG
ACTTCTGGTA GCCTCTTCCA CGTGGCTTGG CAAGGAAACT TTGAACAGTG GATTAAAGAT
CCTCTGAATG TACGTCCCAT CGCTCATGCG ATTTGGGATC CTCAATTTGG TCAAGAGGCC
GTAGATGCCT TCACCCAAGC TGGAGCCTCT AATCCAGTAA ATATAGCCTA TTCTGGTGTT
TACCATTGGT GGTACACTAT CGGCATGAGA ACAAATGGAG ACTTATATCA GGGTTCTATT
TTTCTCCTGA TATTAGCATC CATCATGCTG TTTGCTGGTT GGTTACACCT CCAGCCTAAG
TTTAGTCCTA GCCTAGCATG GTTCAAAAAT GCTGAGTCCC GCCTCAATCA CCATTTGGCA
GGTCTATTCG GTGTTAGTTC CTTAGCTTGG ACTGGTCACT TGGTTCACGT TGCAATTCCT
GAATCTCGTG GACAGCACGT TGGTTGGGAT AACTTCCTGA CGACTATGCC TCACCCAGCA
GGTTTAGGTC CCTTCTTTAC AGGAAATTGG GGCGTTTATG CTCAAAATCC TGATACCGTT
AACCATGTGT TTGGTACTTC TGAAGGTGCG GGAACTGCAA TTTTGACTTT CTTGGGTGGT
TTCCATCCTC AAACTGAGTC CCTGTGGTTG ACTGATATGG CTCATCACCA CTTGGCGATC
GCAGTTATCT TCATAATTGC TGGTCATATG TACCGCACTA ACTTTGGTAT TGGTCACAGT
ATCAAAGAAA TGCTTGATTC CAAAGCTGGT TTAATTGGTG GAAAAAGTCA AGGTCAGTTT
AACTTGCCTC ACCAAGGGTT ATATGAAACT CTAAATAACT CTCTACATTT CCAGTTAGCA
TTAGCTCTAG CTTCTCTTGG TGTAATTACT TCCTTGGTAG CACAGCACAT GTATGCTCTG
CCTCCTTATG CTTTTATGGC AAGGGATTAC ACTACAATGG CAGCACTTTA CACCCACCAT
CAGTATATTG CTGGTTTCAT TATGGTAGGT GCATTTGCTC ATGGTGCGAT TTTCCTAGTA
CGGGATTACG ATCCTGAGCA AAATAAAGGT AATGTTCTCG ATCGTGTCTT GAATCACAAA
GAGGCTATCA TATCTCACTT GAGTTGGGTT TCTTTATTCT TAGGATTCCA TACTCTAGGT
CTTTATGTCC ATAATGATGT AATGGTAGCT TTCGGCACTC CTGAAAAGCA AATCTTGATT
GAACCGGTAT TTGCTCAGTT CGTTCAGGCT GCTTCCGGGA AGGCTTTATA TGGCATGGAT
GTTCTGTTAT CTAACTCTGA TAGTCTTGCT TCCACTGCTG GTGCTGTATG GTTACCAAAT
TGGTTAGAAG CTATCAATAG TGGTACAAAT TCTCTGTTTT TAACCATTGG ACCCGGAGAT
TTCTTGGTTC ACCATGCCAT TGCTCTAGGT TTGCATACAA CAACTTTAAT TTTGGTTAAA
GGTGCATTGG ATGCTCGTGG TTCTAAGTTA ATGCCAGACA AGAAAGACTT CGGTTATGCA
TTCCCTTGTG ATGGCCCTGG TCGTGGTGGT ACTTGCGATA TCTCTGCTTG GGACTCCTTC
TATCTAGCTA TGTTCTGGAT GTTAAACACC ATTGGTTGGG TAACATTTTA CTGGCACTGG
AAAAACTTGT CTGTATGGCA AGAGAACTTG GCTCAGTTTA ATCAATCTTC TACTTATTTG
ATGGGTTGGT TGCGTGATTA TCTGTGGTTG AACTCTTCCC AGTTAATTAA CGGTTATAAC
CCTTATGGAA CTAGTAACTT GTCTGTTTGG GCTTGGATGT TCTTATTCGG ACACCTAGTT
TGGGCAACTG GTTTCATGTT CCTCATCTCT TGGCGTGGTT ACTGGCAAGA GCTAATCGAA
ACTATTGTTT GGGCACATGA GCGTACTCCT CTAGCTAACT TGGTTCGCTG GAAAGATAAG
CCTGTTGCTC TTTCTATTGT TCAAGCTCGT GTTGTTGGCT TAGCTCACTT TACTATTGGC
TATGTTCTGA CTTACGCCGC ATTTTTGATA GCTTCTACTG CTGGTAAGTT TGGTTAA
 
Protein sequence
MATKFPKFSQ DLAQDPTTRR IWYGIATAHD FETHDGMTEE NLYQKIFASH FGHLAIIFLW 
TSGSLFHVAW QGNFEQWIKD PLNVRPIAHA IWDPQFGQEA VDAFTQAGAS NPVNIAYSGV
YHWWYTIGMR TNGDLYQGSI FLLILASIML FAGWLHLQPK FSPSLAWFKN AESRLNHHLA
GLFGVSSLAW TGHLVHVAIP ESRGQHVGWD NFLTTMPHPA GLGPFFTGNW GVYAQNPDTV
NHVFGTSEGA GTAILTFLGG FHPQTESLWL TDMAHHHLAI AVIFIIAGHM YRTNFGIGHS
IKEMLDSKAG LIGGKSQGQF NLPHQGLYET LNNSLHFQLA LALASLGVIT SLVAQHMYAL
PPYAFMARDY TTMAALYTHH QYIAGFIMVG AFAHGAIFLV RDYDPEQNKG NVLDRVLNHK
EAIISHLSWV SLFLGFHTLG LYVHNDVMVA FGTPEKQILI EPVFAQFVQA ASGKALYGMD
VLLSNSDSLA STAGAVWLPN WLEAINSGTN SLFLTIGPGD FLVHHAIALG LHTTTLILVK
GALDARGSKL MPDKKDFGYA FPCDGPGRGG TCDISAWDSF YLAMFWMLNT IGWVTFYWHW
KNLSVWQENL AQFNQSSTYL MGWLRDYLWL NSSQLINGYN PYGTSNLSVW AWMFLFGHLV
WATGFMFLIS WRGYWQELIE TIVWAHERTP LANLVRWKDK PVALSIVQAR VVGLAHFTIG
YVLTYAAFLI ASTAGKFG