Gene PCC8801_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1901 
Symbol 
ID7102857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1977373 
End bp1978488 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content41% 
IMG OID643474962 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002372095 
Protein GI218246724 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG1173] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGGT GGCAAAAGCT TAAAACTAAC TCCCTCGCGC GTTTTGGGGC AATTCTATTA 
ATAATCTTCT ATAGTTTAGT CATAGCAGCC GATTTTGTTG CCCCCTATAA TCCCTATTCT
TCGCAAATTG ACGGATCATT ATTACCTCCG ACTCCCATCT ATTGGACAAC CCCAGACGGT
GAGTTTATTG GACTTCATGT TTATCCCACC ACCCAAAGTC CCACCGATCT TGAAACGGGA
AAACGCACCT TAAATATTGA TTTTAAGCAA CCCACTCCGA TCCGTTTATT TGTTAAAGGC
GATCGCTATC AGCTATTCCA AATTCGTCTC CCCTTACCCC CCACCTTCAA AGAAGTAGAA
ATTTTCCCTG GTATTCCTTT TGATCGTCAT TTATTCGGAA CGGTTGGGCA AGCAAAATTA
AATCTTTTGG GAACCGATGA ACAAGGACGA GATCAATTTA GTCGGCTGAT ATTTGGGGGG
AGAATCAGCC TATTTATTGG CTTAGTTGGT ATTATTATTT CTTTTCCTTT GGGTATGATT
GTTGGTGGTA TTTCTGGCTA TTTTGGAGGG TGGTTAGATG CAGGATTAAT GCGTTTAGTT
GAAGTTTTAA TGACCATTCC AGGGCTTTAT TTATTAGTAG CTTTAGCGGC AGTTTTACCC
CCTAGTTTAA GCAGTACTCA ACGGTTTTTA TTGATTGTTT TAATTACGTC TTTTATTAGT
TGGTCAGGAC TAGCGCGGGT TATTCGGGGA CAAGTTCTCT CTCTTAAAGA ACAGGAATTT
GTTCAAGCAG CAAGGGCAAT GGGGGCAACC CCTTGGCGGA TTATTATTCA GCATATTTTG
CCTCAAACGG CTACTTATAT CATTATTTCT GCTACTTTAG CGGTTCCAGG GTTTATTGTC
GCTGAGTCTG TCTTAAGTTT AATTGGGTTA GGCATTCAAC AACCCGATCC CAGTTGGGGA
AATTTACTTT CTATTGCAAC TAATGCGTCA ATTTTAGTCT TACAACCTTG GTTAATTTGG
CCACCAGCTT TGCTGATCGT TCTCACTGTC TTAGCCTTTA ATTTACTCGG AGATGGACTG
CGCGATGCCC TTGATCCTCG TTCTCTGAAT AATTGA
 
Protein sequence
MNWWQKLKTN SLARFGAILL IIFYSLVIAA DFVAPYNPYS SQIDGSLLPP TPIYWTTPDG 
EFIGLHVYPT TQSPTDLETG KRTLNIDFKQ PTPIRLFVKG DRYQLFQIRL PLPPTFKEVE
IFPGIPFDRH LFGTVGQAKL NLLGTDEQGR DQFSRLIFGG RISLFIGLVG IIISFPLGMI
VGGISGYFGG WLDAGLMRLV EVLMTIPGLY LLVALAAVLP PSLSSTQRFL LIVLITSFIS
WSGLARVIRG QVLSLKEQEF VQAARAMGAT PWRIIIQHIL PQTATYIIIS ATLAVPGFIV
AESVLSLIGL GIQQPDPSWG NLLSIATNAS ILVLQPWLIW PPALLIVLTV LAFNLLGDGL
RDALDPRSLN N