Gene Tery_4669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4669 
SymbolpsaA 
ID4246323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7178626 
End bp7180887 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content44% 
IMG OID638109534 
Productphotosystem I P700 chlorophyll a apoprotein A1 
Protein accessionYP_724110 
Protein GI113478049 
COG category 
COG ID 
TIGRFAM ID[TIGR01335] photosystem I core protein PsaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.086374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATAA GTCCTCCAGA GCGTGAGGCA AAAGTAAGAG TAACTGTTGA TACTGACCCT 
GTACCTACTT CTTTTGAAAA GTGGGGGAAA CCTGGTCACT TCAGTCGGTC TCTAGCCAGA
GGACCAAAAA CCACTACCTG GATTTGGAAC CTCCATGCTG ACGCTCACGA CTTTGATAGT
CACACTTCTG ATTTAGAAGA TGTATCGCGC AAAATCTTCA GCGCACACTT TGGTCACCTG
GCCATAATCT TTGTCTGGTT GAGTGGCGCA TATTTTCATG GCGCTAAATT TTCCAACTAT
GAAGCTTGGT TAGCAGACCC TACAGGAATT AAACCAAGTG CTCAAGTTGT ATGGCCAGTT
TTAGGCCAGG GCATCTTAAA TGGTGATATG GGTGGTGGTT TCCATGGCAT CCAAATTACT
TCCGGATTAT TCCAATTATG GCGCGCTTCT GGCATCACTA ATGAGTACCA ACTATACTGC
ACGGCTATCG GTGGCTTAGT AATGGCAGCC CTGATGATGT TTGCAGGTTG GTTCCATTAT
CATAAGAAAG CACCGAAATT GGAATGGTTC CAGAATGTGG AATCAATGAT GAACCATCAC
TTAGCAGGCT TACTAGGTTT AGGTTGCCTG TCTTGGGCAG GCCACCAGAT TCATGTTTCT
TTGCCTATTA ATAAGTTATT AGATTCTGGG GTAGCACCAG AGGATATTCC TCTACCTCAT
GAATTTCTAT TTAATAAAAG CTTGATGGCA GAGTTGTACC CTAGTTTTGC CAAAGGTTTA
ACCCCATTCT TTACATTGAA TTGGGGTGAA TATGCTGATT TCCTGACCTT TAAAGGTGGT
TTAAACCCTG TAACAGGTGG CTTGTGGTTA TCTGATACAG CTCATCATCA TTTAGCACTA
GCAGTATTAT TCATAGTTGC TGGCCATATG TACCGGACAA ATTGGGGCAT TGGTCACAGC
ATGAAAGAAA TTTTAGAAGC TCATAAAGAT CCACTCATAA TTGGTGGTGA AGGTCATAAA
GGTCTATATG AAGCTTTAAC AACTTCCTGG CACGCCCAGT TAGCAATTAA CTTGGCTATG
CTAGGATCTC TTAGCATCAT TGTGGCACAT CACATGTATG CAATGCCACC GTATCCCTAT
ATTGCGACTG ACTACCCAAC TCAGTTATCT CTGTTCACTC ACCATATGTG GATTGGAGGA
TTCCTGATTG TGGGAGCTGG AGCTCATGGT GCTATTTTTA TGGTGCGGGA TTATGATCCA
GCAAAGAATG TTAACAACGT GTTGGATCGA GTCATTCGTC ACCGGGATGC AATTATCTCT
CACCTGAACT GGGTTTGTAT TTTCCTGGGC TTCCATAGCT TCGGATTATA CGTTCATAAT
GATACTCTGC GGGCTTTAGG TCGTCCCCAG GATATGTTCT CTGATACTGG TATTCAACTA
CAGCCAATCT TTGCTCAGTG GGTACAAAAT CTTCATAGTT TAGCTCCTGG CAATACTGCT
CCTAATGCTT TAGCATCCGT CAGCCCAATT TTTGGTGGCG ATGTATTAGC AGTAGGTGGC
AAAGTGGCAA TGATGCCAAT GACCTTGGGT ACGGCGGACT TTATGGTGCA CCATATCCAT
GCCTTTACAA TCCATGTTAC AGCCCTGATT CTTCTTAAAG GTGTACTGTA TGCTCGTAAC
TCTCGACTAA TTCCAGATAA GAGTGAACTA GGCTTCCGAT TCCCTTGTGA TGGTCCTGGT
CGTGGTGGTA CTTGTCAAGT TTCTGGTTGG GACCATGTAT TCTTGGGTCT TTTCTGGATG
TATAATTCAC TGTCAATTGT GATTTTTCAC TTTAGTTGGA AGATGCAGTC AGATGTCTGG
GGAACAGTAG ATCCAGATGG TACAGTGAGT CACATTACTT ACGGCAACTT TGCCCAAAGT
GCAGTTACTA TCAATGGTTG GTTGCGTGAC TTCTTGTGGG CACAAGCTTC AAACGTAATT
ACATCATACG GTTCAGAATT ATCTGCATAC GGTTTGTTAT TCCTTGGCGC ACACTTTATT
TGGGCATTTA GCTTAATGTT CTTGTTTAGT GGTCGTGGGT ACTGGCAAGA GTTGATAGAG
TCTATAGTTT GGGCTCATAA TAAGTTAAAG GTTGCTCCAG CAATTCAGCC TCGTGCTTTA
AGCATAACCC AGGGTAGAGC AGTAGGAGTA GCTCATTATC TCCTCGGCGG AATTGTCACA
ACCTGGGCAT TCTTCCTAGC TCGAATAATT GCAGTAGGAT AG
 
Protein sequence
MTISPPEREA KVRVTVDTDP VPTSFEKWGK PGHFSRSLAR GPKTTTWIWN LHADAHDFDS 
HTSDLEDVSR KIFSAHFGHL AIIFVWLSGA YFHGAKFSNY EAWLADPTGI KPSAQVVWPV
LGQGILNGDM GGGFHGIQIT SGLFQLWRAS GITNEYQLYC TAIGGLVMAA LMMFAGWFHY
HKKAPKLEWF QNVESMMNHH LAGLLGLGCL SWAGHQIHVS LPINKLLDSG VAPEDIPLPH
EFLFNKSLMA ELYPSFAKGL TPFFTLNWGE YADFLTFKGG LNPVTGGLWL SDTAHHHLAL
AVLFIVAGHM YRTNWGIGHS MKEILEAHKD PLIIGGEGHK GLYEALTTSW HAQLAINLAM
LGSLSIIVAH HMYAMPPYPY IATDYPTQLS LFTHHMWIGG FLIVGAGAHG AIFMVRDYDP
AKNVNNVLDR VIRHRDAIIS HLNWVCIFLG FHSFGLYVHN DTLRALGRPQ DMFSDTGIQL
QPIFAQWVQN LHSLAPGNTA PNALASVSPI FGGDVLAVGG KVAMMPMTLG TADFMVHHIH
AFTIHVTALI LLKGVLYARN SRLIPDKSEL GFRFPCDGPG RGGTCQVSGW DHVFLGLFWM
YNSLSIVIFH FSWKMQSDVW GTVDPDGTVS HITYGNFAQS AVTINGWLRD FLWAQASNVI
TSYGSELSAY GLLFLGAHFI WAFSLMFLFS GRGYWQELIE SIVWAHNKLK VAPAIQPRAL
SITQGRAVGV AHYLLGGIVT TWAFFLARII AVG