Gene Cyan8802_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1033 
Symbol 
ID8390342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1057158 
End bp1058408 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content37% 
IMG OID644979048 
Productextracellular ligand-binding receptor 
Protein accessionYP_003136801 
Protein GI257058913 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.344987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000487812 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGATTA AGCTGTTAAG CCAATTTTTA GTTATTTTAG TTGTTCTAGT CTCCTTATTC 
GTGAACAACC TAGTGTTGGC TAGTGAACAA TATAAAGACC CCATTGTCAT TGGAATGTCT
GCCGCTTTCA CGGGAGCATC AAAAAATCTT GGCTTAGAAT TATATCATGG CTCAATGGCT
TATATCAACA AGATTAATCA GTCGGGAGGC ATTAATGGTC ATCCCCTTGT AATTAAAGCT
TATGATGATG GATATAATCC TTTACCCGCG ATTGAAAATA CGGTGAATCT GGTGGAAGAA
GATGAAGTCA CTGTATTATT TGATTATGTA GGAGATCCCA CCGTTACTAA AATTTTACCA
CTGTTAAAAA AATACGAAGC TAAAAATATC ATGCTATTTT TCCCCTTTAC AGGAGCCCAA
TCCATGAGGC AAGTGCCTTA TAATCAATAT GTGGTTAATC TGAGGGCATC TTATCGGGAA
GAAACCGCCG GATTAGTAGA TCATTTATTA GGGATTGGCC ACAAGCGTAT AGCTGTATTT
TATCAAATTG ATGCCTATGG TCGCAGTGGT TGGGATGGCG TACGCAAGGC ATTAGAAAAG
TATGGACTAG ATATTGTTGC TGAAACGACC TATCGTCGAG GAACTGAATA TAATAGTAGT
TTTAACCCTC AAGTTAAGAT TTTACAAGAG GCCGATCCCG ATGCTATTAT TTCTATTGGT
AACTATCAAG CTTGTGCTGG ATTTATTCGA GATGCAAGAG ATGCAGATTG GGATATTCCT
ATTGCTAATG TTTCCTTGGT GGGGAGCGAA AGTTTATTAA AATTATTATT AGAAACAGGT
CGTAAAACCC AGAGAAACTA TACTCAGAAT TTAATTAATT CCGAGATTCT TCCTAGTTAT
GAGGATCTTT CCCTCCCTGC TGTTAAAGAA TATCGTAATG CCATCAATAG CTATCGTGGA
AAATCACCGA TCACGAAAGA GAATTATACT GAGTCAGGTT ATAATTATGT GAGCTTTGAA
GGGTTTCTAA ATGCTAAATT AATGGTAGAG ATTTTAAAGC GTTGGACAGA TTTTTCTGAT
CAAGATCAGC TTCATGAAAT TGTCGATCAT CTCAACGATT TTGATCTCGG CATTGGGGTT
TCACTACAGT TTAAACATCC TGAACATCAA GGACTACACC AAGTCTATTA TACTACCGTT
TCTAATAATA AATTTGTCCC CCTTAAAGAT TGGAGAAAAT GGTCAAAATG A
 
Protein sequence
MAIKLLSQFL VILVVLVSLF VNNLVLASEQ YKDPIVIGMS AAFTGASKNL GLELYHGSMA 
YINKINQSGG INGHPLVIKA YDDGYNPLPA IENTVNLVEE DEVTVLFDYV GDPTVTKILP
LLKKYEAKNI MLFFPFTGAQ SMRQVPYNQY VVNLRASYRE ETAGLVDHLL GIGHKRIAVF
YQIDAYGRSG WDGVRKALEK YGLDIVAETT YRRGTEYNSS FNPQVKILQE ADPDAIISIG
NYQACAGFIR DARDADWDIP IANVSLVGSE SLLKLLLETG RKTQRNYTQN LINSEILPSY
EDLSLPAVKE YRNAINSYRG KSPITKENYT ESGYNYVSFE GFLNAKLMVE ILKRWTDFSD
QDQLHEIVDH LNDFDLGIGV SLQFKHPEHQ GLHQVYYTTV SNNKFVPLKD WRKWSK