Gene PCC8801_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4203 
Symbol 
ID7104597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4408481 
End bp4410430 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content29% 
IMG OID643477188 
Producthypothetical protein 
Protein accessionYP_002374287 
Protein GI218248916 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAA TATTACCTTT AATTGCCCTG GTTTTGTTGT TGTTAATTTT CTATAAAAAA 
ACAAATTACT TTCGACATTC ATTGTTGTCA TCATCAATAT GTTGGGGGCT TTTATTGACG
ATAATTACTG AAGTTTTGAG CTTAATACAA TCACTTGTCT TTAGTTATCT TTCGTTAGGA
TGGGGATTCA TAAATATATT ATTGATTTTT ATTTACTTAA ATATAAAATT AACTAAACAT
AAGAAAAAAA TAGATCTAAA AAAAATATTT TATAAGCTTT CTAATTTTTG GATAATTATA
GTATTAGGAG TAACTTTTGT TGTTACCATC GTTGGTTTGA CAGCTTTAAT ATCACCGCCG
AATAATAGGG ATTCTATGGA TTATCATATG GCTCGTGTAG CCTATTGGAT TCAAAACCAT
AGCTTAAATC ACTATCCAAC TCATTACTTA GCTCAATTGT ACCAGGGACC CTTAGCAGAA
TTGATAATCG TGAATTTACA AATTTTGAGT GGTGGTGATT ATTTGGCTAA CCTTGTCCAG
TGGTTAAGTT TGATTGGGAG TCTTTTGGGG GTATCATTGA TTGCTAAATT ATTAGGTGCA
GATCTTTGGG GTCAACTATT TTCTGCGGTA GTTGCAGCTA CTATACCAAT GGGTATCCTT
CAAGGATCAA GCACTCAAAA TGATTATGTG GCTTCTTTTT GGTTAGTTTG CTTGACTTAT
TTTGTATTAT TAAATGTTAA AGGTCAGAGA AATTGGAGCA ATTTTTTACA GTTATCCTTT
AGTCTAGGAC TGGCTATTTT GACTAAAGGA ACATCCTATA TCTATGCTCT TCCTTTGATT
ATTTGGTTTT TACTTTCAGA ATTAAAGTAC CTCAAATGGC AGATATGGAA ACCAACTTTA
ATGTTAGGAA TTGTAGTTTT TATAATTAAT TTAGGACATT ATTCTCGTAA TTATTTATTG
TTTGGTAAAA TAATATTTAC TGGAAATAAA TATACAAATG ACGCTTTTTC ATTGCCAATT
TTTATATCAA ATTTCATTAG AAATATAGCA CTACATTTAC GAACTCCAGT TCAGTCATTT
AATTTTTTAC TACAAAAAGT TGTGTCTTTG ATTCATCAAA TATTAGGGAT AGATATTAGT
GATATTAGAA CTACATGGTC TGGACAAGAA TTTCAACTTT ATTTTCCTAG TGGTCAAGAA
GCTGTCAGTA GTTTATTACA TGAAAATTTA GCAGCTAATA CCCTACATTT ATTACTGATC
GTAGGAGCTA TTTTTATATT ATTTTTTCAA GGAAAATACA AAAAAGATAA ATTATTATTT
TATTACTGTA TAATTTTAAT TTCTATGTTT TTCCTGTTCT GTTTATTATT GAAATGGCAA
CCCTGGAATA GTCGTCTACA TTTGCCTATC TTTGTTTTGT TTTCACCGTT TAGTGGTAGT
GTTTTCTCCC GTTTTAAAAG TCGTTCATTT ATCACATTTC TATCAATATT TTTGATTATT
TCATCTTTTC CTTGGGTTTT TTTTAATACA TCTAGACCAA TAATTAGTAA TCTAAAAACG
GAAAGTATAT TAACAACAAG TAGAACAAAT CAGTATTTTA ATAATTGGCC AACAATTAAA
AACCCTTACT TAAAAGCTAC TGGATATATA AATTCTCAAG AATGTTCAAA AATTGGATTA
ACTCGAGATT TTCAAGTATG GCAGTATCCT TTATTTATAT TCATTAAACC GACAACATCT
AATCCATTAG AAATTAGAAA TATTAATGTT ACTAATATCT CGTCAGAAAA AATGAAAGTA
GAACCCTATA AACATTTTAT CCCTTGCGCT ATTATTTCTC TGAATTTTCC GGATCAAAAA
AAGAAAATAA GCAAAGAAAT TAACACTAAT ACTGGAACTT ACAGTATCAA ATTTTCCTCT
AGACTAATTA ATGTTTTTAT GAAACAATAG
 
Protein sequence
MAVILPLIAL VLLLLIFYKK TNYFRHSLLS SSICWGLLLT IITEVLSLIQ SLVFSYLSLG 
WGFINILLIF IYLNIKLTKH KKKIDLKKIF YKLSNFWIII VLGVTFVVTI VGLTALISPP
NNRDSMDYHM ARVAYWIQNH SLNHYPTHYL AQLYQGPLAE LIIVNLQILS GGDYLANLVQ
WLSLIGSLLG VSLIAKLLGA DLWGQLFSAV VAATIPMGIL QGSSTQNDYV ASFWLVCLTY
FVLLNVKGQR NWSNFLQLSF SLGLAILTKG TSYIYALPLI IWFLLSELKY LKWQIWKPTL
MLGIVVFIIN LGHYSRNYLL FGKIIFTGNK YTNDAFSLPI FISNFIRNIA LHLRTPVQSF
NFLLQKVVSL IHQILGIDIS DIRTTWSGQE FQLYFPSGQE AVSSLLHENL AANTLHLLLI
VGAIFILFFQ GKYKKDKLLF YYCIILISMF FLFCLLLKWQ PWNSRLHLPI FVLFSPFSGS
VFSRFKSRSF ITFLSIFLII SSFPWVFFNT SRPIISNLKT ESILTTSRTN QYFNNWPTIK
NPYLKATGYI NSQECSKIGL TRDFQVWQYP LFIFIKPTTS NPLEIRNINV TNISSEKMKV
EPYKHFIPCA IISLNFPDQK KKISKEINTN TGTYSIKFSS RLINVFMKQ