Gene PCC8801_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4043 
Symbol 
ID7104617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4236093 
End bp4237274 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content48% 
IMG OID643477036 
Productaminotransferase class V 
Protein accessionYP_002374136 
Protein GI218248765 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTT ATCTGGATTA TAGTGCGACA ACTCCCCCTC GTTCAGAAGC GATCACCCAA 
GTGGAGGCGA TCCTCAAACA ACAGTGGGGT AACCCGTCCA GTTTGCATAA CTGGGGACAA
CAAGCTGCTA CGATCCTCGA AACCGCGCGT TGGCAAGTGG CTAATTTGAT TAATGCGCCC
TCAGCAGATT CTATCATTTT TACCTCCGGG GGAACCGAAG CCGATAATCA TGCCTTGCTA
GGCATAGCCA GATCCTACAG TAAGCCACAA CATTTGATTA TTTCCTCGGT GGAACATTCG
GCAATTTCCG AAACTGCTCA AATCCTAGCA CAATCGGGGT GGCAAGTCAC GATTTTACCG
GTTAACCGTC AGGGAAGAGT GACTCCACTG GAATTGAAAG CAGCTATTCG ACCCAATACG
TCCCTAATTT CGATTATTTA TGGCCAAAGC GAAATCGGAA CCATTCAACC CATTGAGGAA
TTGGCGAAAA TTGCCCAAGC AGAAGGGGTG CTTTTTCATA CTGATGCGGT ACAGGTAGCC
GGAAGATTAC CCCTCGATGT CCAACGGTTG GGGGTGGATT TATTGTCGCT TTCGGCACAC
AAAATCTATG GGGTTCAAGG GGCCGGGGCG TTATATGTGC GTCCAGGGGT AGAAATTGCC
CCTTTGTTGG CAGGAGGAGG GCAAGAACGA CGGTTACGGT CAGGAACCCA AGCTGTCCCG
GCAATCGCAG CCTTTGGGAT CGCGGCTGAA TGGGCGGCCA CAGAAATAGC CACGGAAACC
CCTCGGTTAC GCGGACTGCG CGATCGCCTT TTCGATTTGA TGGCCGATTG TCCCTATCTT
ATTCCCACGG GGGATAGATT GTATCGCCTT CCCCATCATG TGAGTTTTAT TGTGACTGAC
CCTTTTAATC AAAAAGTCTC CGAAAGGATT ACGGGTAAAA CGATTGTTCG TCAGCTTAAT
TTAGCCGGAA TTGGGATTAG TGCTGGTTCA GCGTGTCATA GTGGTAAATT GAGTCCCAGT
CCGATTTTAT TGGCGATGGG TTATTCTGAA AACGAAGCGT TAGGGGGTAT TCGTTTAACT
CTCGGACGGG AAACAACTTT AGAAGATATT GAATGGACGG CTATGGTTCT TAAGCAAGTT
TTAGGGCGTT TAATGCCACA ATTGGAATGT GTTGGGTGTT AA
 
Protein sequence
MQIYLDYSAT TPPRSEAITQ VEAILKQQWG NPSSLHNWGQ QAATILETAR WQVANLINAP 
SADSIIFTSG GTEADNHALL GIARSYSKPQ HLIISSVEHS AISETAQILA QSGWQVTILP
VNRQGRVTPL ELKAAIRPNT SLISIIYGQS EIGTIQPIEE LAKIAQAEGV LFHTDAVQVA
GRLPLDVQRL GVDLLSLSAH KIYGVQGAGA LYVRPGVEIA PLLAGGGQER RLRSGTQAVP
AIAAFGIAAE WAATEIATET PRLRGLRDRL FDLMADCPYL IPTGDRLYRL PHHVSFIVTD
PFNQKVSERI TGKTIVRQLN LAGIGISAGS ACHSGKLSPS PILLAMGYSE NEALGGIRLT
LGRETTLEDI EWTAMVLKQV LGRLMPQLEC VGC