Gene Cyan8802_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2075 
Symbol 
ID8391391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2088881 
End bp2089891 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content43% 
IMG OID644980053 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_003137798 
Protein GI257059910 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00113563 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAAC AACGAGTCTT ATCTGGAGTA CAACCCACGG GAAACCTGCA TCTAGGTAAC 
TATTTAGGGG CAATTCGCAA CTGGGTAGAG ATTCAGTCAA ATTACGAGAA TTTCTTTTGT
GTGGTGGACT TACACGCCAT TACCGTCCCC CATAACCCGA AAACCTTAGC GCAAGATACC
TATACCATCG CTGCCCTGTA TTTAGCCTGT GGCATCGATC TTAACCACTC TACCATCTTT
GTTCAGTCCC ACGTCAGTGC CCATAGCGAA CTCGCCTGGT TACTCAACTG TCTTACTCCC
CTCAATTGGC TAGAGAGGAT GATACAGTTC AAAGAAAAAG CCCTAAAACA AGGGGAAAAC
GTCAGCGTTG GCTTATTAGA CTATCCCGTG TTGATGGCAG CAGATATCCT TCTGTATGAT
GCTGATCGTG TACCCGTTGG GGAAGATCAA AAACAGCATT TAGAATTAAC TAGAGATATC
GTTATTCGCT TTAATGACCA ATTTGCTACC CCCGAAAATC CCGTCTTGAA AATGCCTGAA
CCCCTGATTC GGACTGAAGG GGCAAGGGTG ATGAGTTTAA CCGATGGAAC CCGCAAAATG
TCAAAATCCG ATCCCTCGGA GATGAGTCGG ATTAATCTGT TAGATCCGCC CGAATTAATT
CAAAAAAAGA TTAAACGTTG CAAAACCGAT CCCATTGTTG GATTAGAATT TGATAATCCA
GAACGACCTG AATGTAACAA TTTATTGGGA CTGTATGGCT TATTATCCCA AAAGACGAAA
CAAGAAGTCA TTACGGAATG TCAAGACATG GGATGGGGAA AATTTAAACC CCTACTAACG
GAAACCACCA TCGAAGCCCT TAAACCCATT CAACTAAAAT ATCAAGAAAT CATGGATAAT
AAGGATTATT TAGATTCGGT TTTGCGAGAG GGCAAAGAAA AAGCAGAAAC CGTCGCCAAT
CAAACTTTAA CCCGGGTCAA AGAAGCGTTA GGTTATTTAG CCCCCCTTTA G
 
Protein sequence
MGKQRVLSGV QPTGNLHLGN YLGAIRNWVE IQSNYENFFC VVDLHAITVP HNPKTLAQDT 
YTIAALYLAC GIDLNHSTIF VQSHVSAHSE LAWLLNCLTP LNWLERMIQF KEKALKQGEN
VSVGLLDYPV LMAADILLYD ADRVPVGEDQ KQHLELTRDI VIRFNDQFAT PENPVLKMPE
PLIRTEGARV MSLTDGTRKM SKSDPSEMSR INLLDPPELI QKKIKRCKTD PIVGLEFDNP
ERPECNNLLG LYGLLSQKTK QEVITECQDM GWGKFKPLLT ETTIEALKPI QLKYQEIMDN
KDYLDSVLRE GKEKAETVAN QTLTRVKEAL GYLAPL