Gene Cag_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1875 
Symbol 
ID3747027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2382485 
End bp2384926 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content49% 
IMG OID637774412 
ProductTPR repeat-containing protein 
Protein accessionYP_380168 
Protein GI78189830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTC AGTTTGTAAA ACATAACCCC GCTTTTCTTG ATGCTGGGCA CTTTCTTGAG 
CAAGTGGTGG CGCGCCGTGC CGATGTAGCG CATTTGCTTG GGCATCTCGG TAGTTGTGAA
CCTATAGGTA CGGTTCGTCA TCTTTTTATT ACCGGTCAAC GTGGCAGCGG CAAAACATTT
GTGGTGCGCC GTGTTGCTTT AGCGGTTGAG CAACATAACG CTTTACGTAG CCGCTACTAT
CCGCTCTTTT TTTCTGAGGA GAGCTACTCT GTTAGTTCAT CAGCCGAATT TTGGCTTGAA
GCTTTATTTC ATCTTGCTCG CCAAACCGCT AATGAGCAGT TTGCCCAAAC CTATCAAGCC
TTGCGCGAAG AGGTTGATGA AGAGCGTATT CGCCAAATTG TGCTGCCGTT GTTGCTTGAT
TTTGCAGATA ATCAAGGTAA AACGCTCCTT TTGATTATTG AAAACTTCTC CATGTTGCTT
GCCGATATGG CAAGCAGTCG CGAAGGTGAA GTGCTTGCGC AAACGCTGTT GCAGGAGCCT
CGTTTTCAAC TCTTAGCTAC GGGAACTTTT ACTTTTGACA GCTTGGAGGT GCCTTTTGGT
AAATACTTTA GTAGCATAAC CCATCACGCG CTTGAGCCGC TCAGCAATGC TGATTGTAAT
GCCTTGTGGC AGCTTTATAG CGGTACGCCT TTAGCCGATG GGCAAATTAG AGCAATGAAC
ATTTTAGCGG GTGGAAATGC TCGTTTGTTG GTAACGCTTG CGCGTGTAGC TAACGGGCGA
ACCTTTAGCC AACTGCCTGA AATTTTGGCG CTTGCTCTTG ATGAGCATAC CGAGTATTGC
AAAAGCTATC TTGATGTGAT GGCGCCGGTT GAGCGCAAAG TCTATTTATC TATTGCGGAA
TTATGGGCAA TGGTTTCAGC GCGAGAAGTG TCATTAGCCG CACGTATTGA TATCAATAAA
ACCAGTGCCT ATCTCAACCG CTTAATTAAT CGAGGTGCTA TTAGCGTTGA GCGGCAGGCA
AAGCGCAATA AGCTGTATGG CGTAACCGAG CGTATTTATA GCATTTACTA TCTCATGCGC
CGCCATGGAT GGCAATCGGG GAGAGTGCGT GCGTTGCTTG ATTGTATGTT GGCATTTTAC
GATCCCGCCT CATTCCCAGA CCGCCTTTCC GATAGTGAGC GTGAGCGCTG CACCTCCGTT
GCTGAAGCAT TAACTATGTT GCCCGAGCGC GAGCATGTGG AGCAGTGCCA TCAAAATTTT
CTTGATGCAA AACGCCACTC CCGTTTTGTG CCATCTCTTG CCTATTTTGT TCGCTTTGTG
CAAAAGAGTG ATGTTTCTCA TGCTGATGAA TCATCTTCAC CGATGCTTGG TGAAAGCTTT
CGTCAAGCGT TTGAACTGCT TGAAACCGAA AGTTACACGG AAGCGCTCCC GATTTTTGAT
GCCATTATTA TGGTGTCGCG CCATAGCGAA AGCGAGCAAG CCATTGGGCA GCGTTATGGT
GCCATGATTG GGCGCGGCGT GGCGCTTGGC AATCTTGAGC GGTATGAGGA AGGATTTCGT
TTGCTTGATG AAGTGGCCGC AACCTGCCAA GAGCGTTCGG TGCGGCGCCG CTTAAAGTGG
GGATTGCTTG CATTGTTGGG CAAAGCAAGT GTACTTGAGC GAGCAGGGCG CATTGATGAA
GCCGTAACGT TGTACGATGA ATTGGTAAGC CGTTATCGCC GTCAGCAAGA ATTGGAATGC
TCCACCTTAG TGGCGGCAGC GCTTTTGCAC AAGTCGCTCT TAGTAAGCAA AAGTAAAGGT
GGAGAAGAAG AGATTGCCAT GTGCGACACC TTGCTTGAAT TGTATAGCGA GCGCGTCGAG
TTACCGTTAG TTGAGTTAGT GTGTGCAGCA TGGCGGAACA AAGCAATTGC GTTTGAAGCG
CTTAATCGTA ATGATGACGC ATTGCTTGCT TATGGAAAAC TGTTAGCGCT ATGCCGTCAG
CGAAGTGAAC CACATATGAT GCAGCACACG GCTCATGCGT TACAGAATAT GGGCGTTGTG
TATGGTAAAA TGCACCGTTA TAGTGATGCC GAGCACTGCT TTATGGAGGT GCAGAGCCTT
GCGCCGCAGC AAGCTCGTGC GCATCTTATG CTTTTAAAGT TGTTAGTAAA AATGGAGGAG
CAGCAGCACG CCGTGCTTTC GGAATTGCGC AACTACCTTG CTGCTACAAG TCTTGCCTTA
CGGGCGCTGC CTCAAACCAT TGAATTATTT ATTACCTGTG CTGTTGCAGG ATATGCGGCT
GAAGCATTAG AGTTGCTGGT GGCATCGCCT CTTGCGGTTT CACTGGAGCC TGTGCAAGCG
GCATTGCAAC ACGCTACGGG TGATGAAGTG CGAAGTGCTC CCACGATCGT AGAGGTTGCT
AACGATATTG TGGCAGCAAT TGAGGCGCGT CGTAACGCAT GA
 
Protein sequence
MSIQFVKHNP AFLDAGHFLE QVVARRADVA HLLGHLGSCE PIGTVRHLFI TGQRGSGKTF 
VVRRVALAVE QHNALRSRYY PLFFSEESYS VSSSAEFWLE ALFHLARQTA NEQFAQTYQA
LREEVDEERI RQIVLPLLLD FADNQGKTLL LIIENFSMLL ADMASSREGE VLAQTLLQEP
RFQLLATGTF TFDSLEVPFG KYFSSITHHA LEPLSNADCN ALWQLYSGTP LADGQIRAMN
ILAGGNARLL VTLARVANGR TFSQLPEILA LALDEHTEYC KSYLDVMAPV ERKVYLSIAE
LWAMVSAREV SLAARIDINK TSAYLNRLIN RGAISVERQA KRNKLYGVTE RIYSIYYLMR
RHGWQSGRVR ALLDCMLAFY DPASFPDRLS DSERERCTSV AEALTMLPER EHVEQCHQNF
LDAKRHSRFV PSLAYFVRFV QKSDVSHADE SSSPMLGESF RQAFELLETE SYTEALPIFD
AIIMVSRHSE SEQAIGQRYG AMIGRGVALG NLERYEEGFR LLDEVAATCQ ERSVRRRLKW
GLLALLGKAS VLERAGRIDE AVTLYDELVS RYRRQQELEC STLVAAALLH KSLLVSKSKG
GEEEIAMCDT LLELYSERVE LPLVELVCAA WRNKAIAFEA LNRNDDALLA YGKLLALCRQ
RSEPHMMQHT AHALQNMGVV YGKMHRYSDA EHCFMEVQSL APQQARAHLM LLKLLVKMEE
QQHAVLSELR NYLAATSLAL RALPQTIELF ITCAVAGYAA EALELLVASP LAVSLEPVQA
ALQHATGDEV RSAPTIVEVA NDIVAAIEAR RNA