Gene Cphamn1_0256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0256 
Symbol 
ID6373911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp246928 
End bp248256 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content49% 
IMG OID642682770 
Productpentapeptide repeat protein 
Protein accessionYP_001958706 
Protein GI189499236 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.306319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CACTGATTTT TTCGAGTATT CTTCTGGCAT TGCAGGGAGG GCCTGCCTGC 
AATACGACCT TCGGATTTGA TTCCGCCACT CTTAAAGAGC TTCTTTCCGG TGTTGGCGAA
TGGAACTCCA TGCGTTCCGG ACAACCATCG AGAACCATAA ATCTGGACAA GGCGACTCTC
GAGGATGCAA CTCTTGTAAA TGCCGACCTG CACAATGCGA GTATGGTAAA CACAAGGCTT
AACGGCGCGA AGCTCAATGG AGCAGATTTC CGTAATGCAA AGCTTTTTTC AGCAAGCCTG
AAAAGAACAG ATCTCAAGCA AACAGATCTG AGCGGAGCAA ACCTACGGGG AGCCGACCTG
AAAAATTCCT ATGCGAAAGA AGCAAAATTC ATTAACGCTG ACCTTACCGG TACCGATTTT
CGATATGCAA ATCTTGAAGG AGCAGACCTG ACAGGTGCTG TTCTTGAAAA CGCTCTCTTT
TTTGATGCCA ACCTCAGCTC TGCTGATCTC AGGGGAGTAA ATCTGACCGG AGCAAAAATG
CTCGGACAGG CAACGCTTCT CAATGCAAGA ATTTCAAACA ATACCATTCT TCCGTCTGGA
AAGCGAGCCA CACCTCTCTG GGCTTCACTA CACGGAGCCC GCTTCTCGAA AGAGACCGAA
CGTTCGCCGG TTGTCATGAA ATATGAACCG CTTCCGCCCC CTGTGGTTTC CGGCAACGCT
ACTGAAAGTG ACCCGGAGTC CATTACCACC GAAAATGTTT CCGAAACACT CCTCATGGAG
GATGTCACCG CATGGAACGA ACTTCGAAAA CAATACCCCG AAATGGAAAC CGATCTTCAG
GATGAAGACC TCGATGATGC CGGTTTGAAA GGTGCTGACA TGAAAAAGCT CGACATGACA
AGCTCAACCA TGAACGGAGC GAAACTGGAT CATGCCGACT TTTCAGAGTC TGACCTGTCG
AGTACGTCAT GGAAAAGAGC AAGCCTGGTA GAAACCGTTT TCCGTAACGC GAACCTTCAG
GGAGCCGATT TTAACCGTGC ATTCATGAAA AAAGCCGACC TGAGCGGCGC CGATCTGACT
GGCGCGCAGC TCCGTGAGAC AAGACTTCAG GAAGCTGATC TTAAAAAATC AAATCTTTCA
AAAACCAATC TCTACGATAC CGATCTCACG TGCGCCGATT TGAGGGGGGC TGATCTGACC
GGGGCCAATC TCCTCTATAC CATTTTGGAC AATGCCCTGA TTTCCGCTGA AACCATCACG
CCTTCAGGAG AAAAAGCAAC TACGGGATGG GCGGTATTGA AAGGTGCTAC GTTTGTTCGT
GAGAAGTAA
 
Protein sequence
MKKTLIFSSI LLALQGGPAC NTTFGFDSAT LKELLSGVGE WNSMRSGQPS RTINLDKATL 
EDATLVNADL HNASMVNTRL NGAKLNGADF RNAKLFSASL KRTDLKQTDL SGANLRGADL
KNSYAKEAKF INADLTGTDF RYANLEGADL TGAVLENALF FDANLSSADL RGVNLTGAKM
LGQATLLNAR ISNNTILPSG KRATPLWASL HGARFSKETE RSPVVMKYEP LPPPVVSGNA
TESDPESITT ENVSETLLME DVTAWNELRK QYPEMETDLQ DEDLDDAGLK GADMKKLDMT
SSTMNGAKLD HADFSESDLS STSWKRASLV ETVFRNANLQ GADFNRAFMK KADLSGADLT
GAQLRETRLQ EADLKKSNLS KTNLYDTDLT CADLRGADLT GANLLYTILD NALISAETIT
PSGEKATTGW AVLKGATFVR EK