Gene Cphamn1_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0478 
Symbol 
ID6374142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp501397 
End bp503367 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content53% 
IMG OID642682996 
Producthypothetical protein 
Protein accessionYP_001958923 
Protein GI189499453 
COG category[L] Replication, recombination and repair 
COG ID[COG1555] DNA uptake protein and related DNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.522287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAA CCATTTTTTT AGCGGCATTT CTCCATCTAT TCATTTTTCA ATCCCTGCCG 
GTTTTCGCAG ACGACGATCT CGAAGCCCTC TTTGATCAAA GCGATATGCC CGGCGACATC
GAGCAGCTGC TTCTTGAACT GCAGGAGCTG AAGCAGAGGA AAATCCCTGT CAATAGCGCG
ACGGAAGAGG ACCTTCTGCT TATCCCGTTT CTCTCGAACG ACGATGCCCG CAGGATCATC
GAGTACAGGG AGAAGAACGG CCCCCTGACT TCTGTGGGGC AGCTTGCCGG GGTTATCGGC
AGTGACCTGG CGCGCAGGAT TTCACTGTTT CTCTCCTTTG AGTCCCCGAG GCTTATAGTT
CCTGAGAAAG CGGTTCCCTT TAGCGGAAAC TGGTACGGCA GATACTTCAG TGAAAGCCCC
GAGCGGAGCG GGATTCTTTC AGGGAAATAC GGAGGAGAGA GCTACAAGTT GTACAACCGC
TTGCAGGTGG TCAACGGGGG GATTTCGGTA AACGGGGTAA TGGAAAATGA TGTCGGAGAG
CCTGATATCG ACGACTTTAC CTCGTTGAGT GTCGCATACG ACGGTTCCGG GAGTTTCGAG
CGGCTGATAG CCGGTAACTA TACGGTCAAT TTCGGTCAGG GGCTGTTGTT CGGGCAGAGC
AGATACCTTT CAAAAGGGGT AGATCCTCTC GGGGTGAAGC TTTCCGGGCG TCGGCTCAAA
GCCTACGCTT CAAGTGCGGA AAACGGTTTT ATGCAGGGCG CGGCGACAAC TCTCAATCCG
GACCCGTTCA GGCTCACGGC GTTTTATTCC AGCAATCTGA TCGATGCTTC CGTGGAAGAC
GGAACAGTCA CCACCATCCG TACATCAGGC TACCATCGAA CTGAGAGTGA AATCGAGCAC
AAAGATAACG TGACTGAGCA GGCTGGAGGG GTGAACATCC TCTACACGCT TGATTCCGGG
CCGGTCAATG GAACTGTAGG TGGGACATGG GCGCGCTACC GCTATTCGAT GCCCCTTGAC
GATATCGAAG GCAGCGGGGA ATGGCTTGAT ATGGGAGGTG TCGAGGCTGA TCTGCTCATA
GGGAAGGTCA ATGTTTTCGC GGAAGCTGCT GTGACCGGCA AAGATCCCCG GCTCTCCTGG
ATCAGCGGAA TGCGTTTTCC GTTGACCGAT GATATCCGCA CTGTACTTGT GGTCAGAGAT
TATCATAACC GGTACTTTTC CCCCTTCGCT GGCGCTTTCG CTGAACGTGC GGATGACGCG
TCAAACGAAG AGGGCTATTA TATCGGTCTT GAAGCAAAAA TCCTGAAGAA CCTTCGCCTC
GGGGCCTACT ACGATATCTT CAGGTTTCCC GAGCTCAGCA GCCGATACCG ATTGCCATCG
ACAGGGGACG AAGCGAAAAT TTTTCTCACC TGGAAACAGT CCCCGGTGTT GACGACGGAA
CTGCTGTTGC AGAACCAGTA CAAGGAAGAG GCCAAAAAAC TTGAGGACGG ATCAGGTCGT
GAATATTACC AGCCGGTTCC CTTCAGGTCG AACCGCGCAC GCCTTGGCCT TATCGGAAAA
GTTTCCAGGT GGCTGACGCT CAAGACAAGG GGAGAGATCA AGTTTGTGGA TGGAGAGTAT
CCCGATGGTG ATGACCATTC CGAAGGGTGG CTGATCTATC AGCAGGCGAC GATACGCAAG
GATCCTGTCA CCTTCAAGGC CCGCTACACC AGGTTCTTTA CCGATGACTT CGACTCTGCG
ATCTATGTCT ATGAAGATGA CCTGCCGCTG GTCTTTACCC TGAAATCCTA CTATGGAGAG
GGACAGGCCG CTTTCGCGGT TGTTTCGCTT GATCTTCTCA AGAATTTCAA ACTCTCCGCC
CGATACGGCA AAACATGGTA TGACGATCGC GAGGTATACA GCAGCGGCAA CGACAAACGA
GAAACCAACG CCCCCGCGTC GTATCATCTC GGTTGTGCGT TACGGTTTTG A
 
Protein sequence
MRATIFLAAF LHLFIFQSLP VFADDDLEAL FDQSDMPGDI EQLLLELQEL KQRKIPVNSA 
TEEDLLLIPF LSNDDARRII EYREKNGPLT SVGQLAGVIG SDLARRISLF LSFESPRLIV
PEKAVPFSGN WYGRYFSESP ERSGILSGKY GGESYKLYNR LQVVNGGISV NGVMENDVGE
PDIDDFTSLS VAYDGSGSFE RLIAGNYTVN FGQGLLFGQS RYLSKGVDPL GVKLSGRRLK
AYASSAENGF MQGAATTLNP DPFRLTAFYS SNLIDASVED GTVTTIRTSG YHRTESEIEH
KDNVTEQAGG VNILYTLDSG PVNGTVGGTW ARYRYSMPLD DIEGSGEWLD MGGVEADLLI
GKVNVFAEAA VTGKDPRLSW ISGMRFPLTD DIRTVLVVRD YHNRYFSPFA GAFAERADDA
SNEEGYYIGL EAKILKNLRL GAYYDIFRFP ELSSRYRLPS TGDEAKIFLT WKQSPVLTTE
LLLQNQYKEE AKKLEDGSGR EYYQPVPFRS NRARLGLIGK VSRWLTLKTR GEIKFVDGEY
PDGDDHSEGW LIYQQATIRK DPVTFKARYT RFFTDDFDSA IYVYEDDLPL VFTLKSYYGE
GQAAFAVVSL DLLKNFKLSA RYGKTWYDDR EVYSSGNDKR ETNAPASYHL GCALRF