Gene Tery_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1230 
Symbol 
ID4242157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1904238 
End bp1905296 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content43% 
IMG OID638106443 
Productphotosystem II D2 protein (photosystem q(a) protein) 
Protein accessionYP_721054 
Protein GI113474993 
COG category 
COG ID 
TIGRFAM ID[TIGR01152] Photosystem II, DII subunit (also called Q(A)) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000683761 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTAG CAGTCGGACG CCCACAATCT CAAAGAGGAG CATTTGATGT CCTCGATGAC 
TGGCTAAAAC GCGATCGCTT TGTATTCGTT GGTTGGTCTG GTATACTTCT CTTCCCATGT
GCTTTCTTGT CCATTGGAGG ATGGTTAACT GGGACTACCT TCGTCACTTC CTGGTATACC
CACGGGTTAG CCAGTTCTTA CCTAGAAGGA TGTAATTTTC TAACAGTTGC AATTAGCAGC
CCAGCTTATA GCATGGGACA TTCCCTACTC TTCCTTTGGG GCCCAGAGGC TCAGTGGGAC
TTTGCCCGTT GGTGTCAAAT CGGTGGTCTC TGGTCATTCA CTGCCCTACA CGGTGCATTT
GCTCTAATTG GATTCTGCCT ACGTCAGATT GAAATTGCTC GTTTGGTTGG AATACGTCCT
TACAATGCTA TTGCCTTCAC AGGACCAATT GCAGTATTTG TAAGTGTATT TTTAATGTAC
CCATTGGGTC AATCAAGCTG GTTCTTTGCA CCTAGTTTTG GTGTAGCTGG TATATTCAGA
TTTATCCTAT TTTTACAAGG ATTCCATAAC TGGACACTCA ACCCATTTCA TATGATGGGT
GTAGCAGGAA TCTTGGGTGG TGCACTTTTG TGTGCTATTC ATGGTGCTAC AGTAGAAAAC
ACACTTTTCC AAGATGGAGA GGCAGCAAAT ACCTTCCGCG CATTTGAGCC TACTCAATCT
GAAGAAACAT ATTCAATGGT GACAGCTAAT CGTTTCTGGT CCCAAATTTT CGGTATTGCA
TTTTCCAACA AACGTTGGTT ACACTTCTTC ATGTTATTTG TACCAGTAAC AGGATTGTGG
ATGAGTTCCA TAGGTATAGT GGGTTTAGCA TTCAACCTAC GAGCTTATGA CTTTGTATCT
CAAGAGTTAC GGGCAGCAGA TGACCCAGAA TTTGAAACAT TTTATACCAA AAATATCTTG
CTAAATGAAG GTATCCGAGC TTGGATGTCT CCTGCAGACC AACCACATCA AAACTTTATG
TTCCCAGAGG AAGTACTACC TCGTGGTAAC GCTCTTTAA
 
Protein sequence
MTVAVGRPQS QRGAFDVLDD WLKRDRFVFV GWSGILLFPC AFLSIGGWLT GTTFVTSWYT 
HGLASSYLEG CNFLTVAISS PAYSMGHSLL FLWGPEAQWD FARWCQIGGL WSFTALHGAF
ALIGFCLRQI EIARLVGIRP YNAIAFTGPI AVFVSVFLMY PLGQSSWFFA PSFGVAGIFR
FILFLQGFHN WTLNPFHMMG VAGILGGALL CAIHGATVEN TLFQDGEAAN TFRAFEPTQS
EETYSMVTAN RFWSQIFGIA FSNKRWLHFF MLFVPVTGLW MSSIGIVGLA FNLRAYDFVS
QELRAADDPE FETFYTKNIL LNEGIRAWMS PADQPHQNFM FPEEVLPRGN AL