Gene PCC7424_4954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_4954 
Symbol 
ID7107020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp5502296 
End bp5503357 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content36% 
IMG OID643483166 
Productprotein of unknown function DUF21 
Protein accessionYP_002380176 
Protein GI218441847 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGAGC TAGTTATTGT TGCTCTCTTA GTCATGGTTG GTTCTGGTAT CTGTGCTTGC 
ACTGAAACAG CGATTTTATC CGTTTCGCCG ATTAAAGTTA GGGAATTATC CCAATCTGGA
CAAAAATCCG CCTCAGTTTT GTTAACGATT CGAGAAAATA TTAATCATCC TATTGCCACC
ATTGTCATGA TTAATAATCT GTTTAACATT TTTGGCAGTA TTTTTATTGG AAGTATAGCC
TCAAAAGTGT TAGGAAATAT GTGGCTAGGA TTATTTTCAG GAGTTTTTAC TTTTTTGATC
ATTATTTTTG CGGAAATTAT CCCTAAAACT TTGGCTGCTC GTTATGCTAC TCAGATAGCG
TTGTTTGTAG CGATTCCTCT CAAATTAATT ACTCAAATTT TTAAGCCCTT TACTGTCATC
ATTGAAACCC TGACCTTACC CTTTACAAAA AAAGATAAAC TTCCCAGTAC CAGTGAAGCA
GAAATTAAAA TTTTAGCGAG TATTGGTCGT CGGGAGGGGG TAATAGAAAA AGATGAATCA
GAAATGATTG AGCGAGTCTT TCAATTAAAT GATCTTAAAG CAGAAGATTT GATGACTCCT
CGAATTATTG TCACTTATCT CAAAGGAGAG TTAACCCTAG AGGAATGTCA AGATATAATT
TCTCATTCAG AACATACTCG AATTTTAGTC ATTGGAGAAA CCATTGATAA AGTCTTAGGG
ATAGCTTTAA AACATGAATT ATTAACCGCC ATTATTGAAG GAAAACAAAA GCAACCTATT
TCAACTTTTA CCCGTTCAGT GAATTTTGTT TCTCAAGAGA CTAAAGCCAA TGAATTACTA
AAAACGTTTC AGACATTAGG AGAACATTTA ATCGTCGTTC TTGATGAGTA TGGGGGAGTG
GCTGGCGTTG TCACTCTAGA GGATGTGTTA GAAGTTTTAA TAGGGGAAAT TGTCGATGAA
ACCGATAAGT TTGTCGATCT GCAACAAATC GCCCGACGGA AACGAAAAAT TTTATTAGAA
GCCAGAGGAA TACAACAACA AGAAATGATA CAAGTTTCCT AA
 
Protein sequence
MVELVIVALL VMVGSGICAC TETAILSVSP IKVRELSQSG QKSASVLLTI RENINHPIAT 
IVMINNLFNI FGSIFIGSIA SKVLGNMWLG LFSGVFTFLI IIFAEIIPKT LAARYATQIA
LFVAIPLKLI TQIFKPFTVI IETLTLPFTK KDKLPSTSEA EIKILASIGR REGVIEKDES
EMIERVFQLN DLKAEDLMTP RIIVTYLKGE LTLEECQDII SHSEHTRILV IGETIDKVLG
IALKHELLTA IIEGKQKQPI STFTRSVNFV SQETKANELL KTFQTLGEHL IVVLDEYGGV
AGVVTLEDVL EVLIGEIVDE TDKFVDLQQI ARRKRKILLE ARGIQQQEMI QVS