Gene PCC7424_4573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4573 
Symbol 
ID7108323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5058420 
End bp5059760 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content33% 
IMG OID643482790 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_002379803 
Protein GI218441474 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT TAAGTTATAA AAAAATTACT TTACCTGTTT TGATTATTGT TATTACAGTT 
TTAATTGTTT TTACGGTCAA CTGGATTTTA AACCATCAAA AAGTTGATAC TTTAATTTTA
GCAACGGGGA ATGAAAAAGG ACAATATTAT GCCTTTGGAA AAGCGTTATC TAAGGTAGTA
AAAAAACATA ATTCCAAAAT TAATATTGAA GTTTTATCGA GTGAAGGATC AAAGCAAAAT
GTGGACTGGT TAAAGAAAGA AAAAGCTCAA TTAGCCATTG TTCAAAGTGA TACATTACTG
AGTCCGTCTA TTGAAGTGGT TAGTCTTTTA TTTCCTGAAG TTTTTCATTT ATTAGTGAGA
GAAGAAAAAG GGATTAAAAG TTTTAGTGAT TTAAAAGGAA AAAAAATTGC TTTAATGTCA
AAAAAAAGCG GTTCTTATGC TTTATTTGAA GTTTTAAGTC ATCATTACGG TCTAAAACCT
TCAGAATTTA TCCCTTTATC CATGTCCTTA GACGAAGCTA TTCGAGCCTT AGAAACGGGA
AAAGTGGATG CTATGTTTCA GGTCATAGCA TTAGGAAATC CTAATATAAC ACGACTCTTA
CAAAATCGAA ACATTGAGCT AGTTTCTATT GATCAAGGCG CAGCTTTACA ACTCTTAGTT
CCTGCCCTTG AAAATACTAT TATTCCCAAA GGAACGTATA ACGGAGCAAT TCCTATTCCG
GAAAATGACT TATCAACCGT TGGAGTCAGA GCCACCTTAG TAACCGATCG TCAAATTGAA
TCTAGTTTGA TTTATGAAAT TACTCGAATT TTATACGAAG CTCGTAATGA GTTAGTCAAA
GAAAATGTGC AAGCCGCTAT GATTTCTCTG CCCAATTCTA CCGATCAGAT CGGTTTTGCT
TTTCATCCGG GTGCTAAAAC TTACTATGAT CATGATCGAC CTAGTTTTAT TGTTGAGTAT
GCTGAACCAA TTAGCTTAGG AATGTCGGTG GTGGTACTTT GTTTTTCAGG ATTATGGCAA
TTACGAATAT GGATACAAGG GAAGAAAAAA AATCGGGCTG ATGTTTACAA TATCCAATTA
ATTGAGTTAA TTGAAAAAAT TAATCAGGCT CAAAATCTTA AAGAATTAAG AGAAATTCAA
GTTCAATTAT GGGAAATATT TGAAAAAGTG ATTATTGATT TAGACTACGA TCGGATTTCA
GCAGAATCCT TTCAATCTTT TACCTTTCCT TGGAATGTTG CCCTCAAGTC GATTCATCAT
CGAGAAACCC TTTTAAGCAC AATACAAGAT CATCAATTAT GGAAAGACAA TTCTGTAGCC
AAAAATCATC AAAATTTATA G
 
Protein sequence
MKKLSYKKIT LPVLIIVITV LIVFTVNWIL NHQKVDTLIL ATGNEKGQYY AFGKALSKVV 
KKHNSKINIE VLSSEGSKQN VDWLKKEKAQ LAIVQSDTLL SPSIEVVSLL FPEVFHLLVR
EEKGIKSFSD LKGKKIALMS KKSGSYALFE VLSHHYGLKP SEFIPLSMSL DEAIRALETG
KVDAMFQVIA LGNPNITRLL QNRNIELVSI DQGAALQLLV PALENTIIPK GTYNGAIPIP
ENDLSTVGVR ATLVTDRQIE SSLIYEITRI LYEARNELVK ENVQAAMISL PNSTDQIGFA
FHPGAKTYYD HDRPSFIVEY AEPISLGMSV VVLCFSGLWQ LRIWIQGKKK NRADVYNIQL
IELIEKINQA QNLKELREIQ VQLWEIFEKV IIDLDYDRIS AESFQSFTFP WNVALKSIHH
RETLLSTIQD HQLWKDNSVA KNHQNL