Gene Tery_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4666 
Symbol 
ID4246320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7173887 
End bp7175413 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content43% 
IMG OID638109531 
Productphotosystem antenna protein-like 
Protein accessionYP_724107 
Protein GI113478046 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0622483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120025 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTAC CTTGGTATCG TGTACATACA GTTGTTCTGA ATGATCCAGG ACGACTAATA 
TCCGTGCATT TAATGCACAC TGCTCTTGTA GCAGGTTGGG CCGGATCAAT GGCCCTTTAT
GAACTGGCAA TCTTTGATCC TAGCGATCCA GTGCTAAACC CTATGTGGCG TCAGGGAATG
TTTGTTATGC CATTCATGGC CCGTCTAGGT GTTACCCAAT CTTGGGGTGG TTGGAGTGTT
ACAGGAGAAG TAGCTTCAAA CCCTGGTTTC TGGTCTTTTG AAGGTGTAGC AGCAGCTCAT
ATCGTTCTTT CTGGTTTGCT ATTCCTAGCT GCTGTATGGC ATTGGGTTTT CTGGGATTTA
GATTTATTCT TTGACCAACG TACAGGAGAT CCTGCTCTAG ACCTGCCAAA AATGTTTGGC
ATTCACTTAT TCTTATCTGG TTTACTTTGC TTTGGCTTTG GGGCCTTCCA TTTGACCGGA
GTATTTGGTC CAGGAATGTG GGTATCAGAT CCTTATGGGT TAACAGGCTC GATCCAGCCA
GTAGCACCAT CATGGGGACC AGAAGGATTT AACCCATTTA ATGCTGGTGG AATTGTTGCT
CACCACATTG CTGCTGGAAT TGTGGGAATT ATTGCTGGTT TATTCCACTT ATCAGTAAGA
CCGCCAGAAC GTCTCTACAG AGCCTTGCGG ATGGGTAATA TTGAAACAGT TCTTTCCAGT
AGTATTGCAG CAGTTTTCTT TGCAGCTTTT GTTGTAGCTG GTACAATGTG GTATGGCTGT
GCAGCTACTC CTATAGAATT ATTTGGCCCT ACTCGTTATC AGTGGGATAA TAGTTACTTT
CAGCAAGAAA TTGATCGTCG TGTTCAAGCT GAAATAGTAG CAGGAAAAAC TCCCTCTGAA
GCTTGGTTAA CTATCCCCGA AAAATTAGCC TTTTATGACT ACGTTGGTAA TAGTCCAGCA
AAAGGCGGTC TTTTCCGTGT TGGTCCTATG AATAATGGAG ATGGTTTAGC TCAAGCTTGG
CTTGGTCATC CTATATTTGA GGATAGGGAT GGTGAAGAAT TATTTGTACG TCGCTTACCT
AACTTTTTTG AAACTTTCCC AGTAGTTTTG ACCAATTCTG AAGGTGTAGT AAAGGCTGAT
ATTCCTTTTC GTCGTTCAGA GTCAAAATAT AGTTTTGAGG AACGGGGGGT AACAGTTAGC
TTCTTGGGTG GTGAACTTGA TAGTCAAACC TTTACTAACC CAGCCGATGT TAAAAAGTAT
GCTCGTAAAG CCCAAACTGG TGAACCTTTT GAATTTGACC AAGAAACTTT AGGCTCTGAT
GGTGTGTTCC GAACTAGTAC TCGTGGTTGG TTTACTTTTG GTCATGCTTG TTTTGCTCTC
CTATTCTTCT TCGGTCATAT TTGGCATGGA TGCCGGACCT TATTCCGCGA TGTGTTTGCA
GGTATTGACC CAGACCTAGA GGAACAAGTA GAGTTTGGTT TATTCCAAAA ACTGGGTGAC
TTGAGTACCC GTAGGAAGGA AACATAA
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
FVMPFMARLG VTQSWGGWSV TGEVASNPGF WSFEGVAAAH IVLSGLLFLA AVWHWVFWDL
DLFFDQRTGD PALDLPKMFG IHLFLSGLLC FGFGAFHLTG VFGPGMWVSD PYGLTGSIQP
VAPSWGPEGF NPFNAGGIVA HHIAAGIVGI IAGLFHLSVR PPERLYRALR MGNIETVLSS
SIAAVFFAAF VVAGTMWYGC AATPIELFGP TRYQWDNSYF QQEIDRRVQA EIVAGKTPSE
AWLTIPEKLA FYDYVGNSPA KGGLFRVGPM NNGDGLAQAW LGHPIFEDRD GEELFVRRLP
NFFETFPVVL TNSEGVVKAD IPFRRSESKY SFEERGVTVS FLGGELDSQT FTNPADVKKY
ARKAQTGEPF EFDQETLGSD GVFRTSTRGW FTFGHACFAL LFFFGHIWHG CRTLFRDVFA
GIDPDLEEQV EFGLFQKLGD LSTRRKET