Gene PCC7424_4971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4971 
Symbol 
ID7107037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5523338 
End bp5524417 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content46% 
IMG OID643483182 
Productbiotin synthase 
Protein accessionYP_002380192 
Protein GI218441863 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.819396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCAAG CATCCCTATC TTCACCCATT ACCCCTCCTA AAGATACCCA AGCTTTACAA 
GAATGGCTTC ATCAACTGGC CGATCGCATT ATCTCAGGAT ATCGTATTAC TAAAGTAGAA
GCCCTGGCCT TAACCGAAAT CGAAGGGCAA GATCAGATAC TCCTATTGTG TGAAGCGGCT
GATCGCATTC GTCAAGCCTG TTGCGGTAAT AGGGTTGATT TATGTAGCAT TATCAATATA
AAATCCGGCC ACTGTTCGGA AAATTGTAGC TTTTGTTCTC AATCTGTCCA TCATCCGGGT
CAAGATTCCC CAGTCTATGG ACTCAAAACC TCAGAAGAAA TTGTACAACA AGCCAAAGCC
GCCGCCGCCG CCGGTGCTAA ACGGTTTTGT TTAGTCAGTC AAGGACGAGG ATTAAAATAC
AATAGCCCCA AATCTAAAGA ATTTGCAGAA ATTTTAGCCA CTGTAAAACG CATCACCACA
GAAGCCAAGA TCAAACCCTG TTGCGCCTTA GGAGAATTAA CCCTCGAACA AGCACAGGCA
TTAAAAGAAG CCGGTGTTAC CCGTTATAAC CATAATTTAG AAGCCTCAGC AACATTTTAC
CCCCAAATTG TCACGACTCA TACCTGGGCC GATCGCGTGG AAACGGTAAA AAATCTTAAA
GCCGCCGGCA TTCAAGCCTG TACCGGTGGC ATTATTGGTA TGGGAGAAAG TTGGGAAGAC
CGGATCGATT TAGCCCTATC TTTACGAGAC TTAGAGGTAG ATTCTGTGCC GATTAATCTC
CTTAACCCCA GACAAGGGAC TCCCTTAGGC CATCTCCCCA AACTTGACCC GTTTGAAGCG
TTACAGGCGA TCGCTATTTT CCGCTTTATT TTACCGCAAC AAATCCTCCG CTATGCAGGA
GGACGAGAGG CCATCATGGG AGAGTTGCAA AGTTTAGGGT TAAAAGCGGG AATTAATGCT
ATGCTAATTG GACATTATCT GACCACTTTG GGACAATCTC CCCAACAAGA TCAGGCCATG
TTAAAATCTC TAGGGTTAGA GGGAGGTGAA GCCCCAATTC CCGGTGAATA CCAACCCTAA
 
Protein sequence
MVQASLSSPI TPPKDTQALQ EWLHQLADRI ISGYRITKVE ALALTEIEGQ DQILLLCEAA 
DRIRQACCGN RVDLCSIINI KSGHCSENCS FCSQSVHHPG QDSPVYGLKT SEEIVQQAKA
AAAAGAKRFC LVSQGRGLKY NSPKSKEFAE ILATVKRITT EAKIKPCCAL GELTLEQAQA
LKEAGVTRYN HNLEASATFY PQIVTTHTWA DRVETVKNLK AAGIQACTGG IIGMGESWED
RIDLALSLRD LEVDSVPINL LNPRQGTPLG HLPKLDPFEA LQAIAIFRFI LPQQILRYAG
GREAIMGELQ SLGLKAGINA MLIGHYLTTL GQSPQQDQAM LKSLGLEGGE APIPGEYQP