Gene Cag_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1728 
Symbol 
ID3746511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2244347 
End bp2245516 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content49% 
IMG OID637774265 
Productpentapeptide repeat-containing protein 
Protein accessionYP_380022 
Protein GI78189684 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.952308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTAA CGTTTCCGCC AGCCGTTTCT TTACAATTCT TTAATATGAA CAATCAAAGC 
ATGATCTCCT CTTTTTTTCA ACGCATGGCG TTTACGGCAC TTGTGGCAAG CGCACTACCA
TCATCTCTTT TTGCTTACGA CCGTGCTCAT GTAACCTTGT TGCAACAAGG TGTTGCGGTG
TGGAATAACC AGCGTCAAGC AACCATGGGG CAAACGCTTG ATTTATCGCG AGCACCGCTT
GCTAAGGCAC AACTTGGCGA GGCGAATTTA GCGCATGTGT CGTTAAGCAG TGCTTTTTTA
CAAGCAGCAA ACTTGCGTGG GGCTAATTTG CAAGCGGCTA ATTTGCGTTG GAGTGTGCTT
GATGGTGCCG ATTTGCGCGA TGCGGTGCTG GTGGGTGCTC ATTTATTTGA AGCAAGCTTG
GTGAAAGCAG ATGCTCGTGG CGCTAATTTT AAGAGTGCAA CATCGTTGGA GCAAGCTGAT
CTTTCAGGCG CACTTGTTTC CAATAACACC ATTGTGCCAT CAGGTGAACG CGCTCACGGG
CAGTGGGCGT TACGCCATCA TGCTACGTTT GTGCAGGAGC CTGAGCGCCC GATTGCCTCT
ATTGCTTCAG CCATCTCATT TTCTCCCGAA CGTACCATCA CCTCACCGCC TAACTCTGCG
CCAACAACGG TATCTCAATC TGCGGTAACT GCTCACCCCT CAAACGTTGT GCCATCACCA
CAAGCATCAG CGCAAGCGCC AATAACGAAA GAGTATGCCC GTGCTACGCT GAACGGGGTC
AATTGGAGCA ATGCGGATTT AGCAGGTGCT AATTTTTATA AGGCTGATAT GAAAGGTGCC
CAGCTACAAG GTGCAAATTT GCAAGGCGCT CATTGTGATC GGGCGTTTTT GCTTCAAGCC
AATTTACAAG GTGCTAACCT TACGAAAGCA CTGTTGTTTG GCGCTACACT CGACAAGGCT
GATTTAAGAA ATGCTAATTT AACGGAAGCG TCGCTTTTTG GGGCAAATTG TGAGGGAGCT
GATTTGCGTG GAGCTATTTT AACGAGGGCA AACGTAACGG ATGCGGTGTT GACTAATGCG
CTTATTTCTT CCACCACCGT GCTGCCATCA GGTAAAGCGG CAACACGGCA GTGGGCGCTC
ATGCAGCAAG CTATCTTTAG TCAGGATTGA
 
Protein sequence
MRLTFPPAVS LQFFNMNNQS MISSFFQRMA FTALVASALP SSLFAYDRAH VTLLQQGVAV 
WNNQRQATMG QTLDLSRAPL AKAQLGEANL AHVSLSSAFL QAANLRGANL QAANLRWSVL
DGADLRDAVL VGAHLFEASL VKADARGANF KSATSLEQAD LSGALVSNNT IVPSGERAHG
QWALRHHATF VQEPERPIAS IASAISFSPE RTITSPPNSA PTTVSQSAVT AHPSNVVPSP
QASAQAPITK EYARATLNGV NWSNADLAGA NFYKADMKGA QLQGANLQGA HCDRAFLLQA
NLQGANLTKA LLFGATLDKA DLRNANLTEA SLFGANCEGA DLRGAILTRA NVTDAVLTNA
LISSTTVLPS GKAATRQWAL MQQAIFSQD