Gene Cag_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2000 
Symbol 
ID3747110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2535074 
End bp2537176 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content50% 
IMG OID637774537 
ProductFis family transcriptional regulator 
Protein accessionYP_380291 
Protein GI78189953 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.255571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATCA CCCACTCTTT GCACGAGCTG CATCGGCTGA TGCGAACCAT CAGCAACGCC 
ATTGGCTCTC TTCACGATCC TCAAGAGCTA TTCCATACCA TTATAACTCA GCTCCGCCTC
TTTTTTGCTT TCGATTCTGC TGCCATTATT ACCCTCGACA ACAATCAGCG CCATTGTTCT
CTCTTTTTTG AAATGCTGCG TTTTCAATTG CCTGAGGGTT TTGTGCGTTA TCAACGCCCT
GTGGCAGGCT CGTGGATTGA GCCGCATGTG GAGGCGCAAA AGGTTGCCGT TATCCGTTTG
CGCGATTCAC TTGAGCATTT CCCCGTTGAT GTTGAGTTAC GCCAACATTT GCTTGAGCTT
GGATTAAAGC AAGCAGTGCT TTCGCCTTTG CGGGCAGGTG GCAAGGTGAT TGGTTTTTTG
ACCTTTGTTT TTCAAGAAGA GCAGGAGTGG CTTGATGAGG AGTGTGAGCT CTTAGCTACC
GTTTCCACGC CTATTGCCGT TGCCGTGAGC AATGCGTTAG CTTACGAGGA ACTCCGTAAG
CGGGAGGCGC AAACCGCTAT GCAGCTTGCC GTTAACAATG CGTTGCTTAC CATTCGTGAG
CGCAATAGCA TGATGCTTGC GGTGTGTGAG CAGGTGCAAA AGTTGCTGCC TTGTACCTTT
CTTGGCATTA GGGTGATGGA TAAGGAGGGC AACTATCGCT TGTACGATAA CTTTCGTCTT
CAGCCTGATG GAGCCTTTGC GCCTGTAACT GCGGCACTTG ATATGGAGAA TTACGACTCC
ATTGCCCGTG CAAGTTTTGA TTTAGCCATC ATTGGTGGTT ACTACGGCGG CAACAGCTTT
CTTGAGCTTT GCGCTCGCTT TCCAATTTTA GAGCATGTTC GCCAAAAATA TGGCATTTGC
TCATTCTTGA GCCTTCCGTT ATGGGAGCTG CCAGCCAGCC GTGCTGGGCT TATTATTTCT
ACCACTATGC GCCCTTTTGA TGGGCACGAT CAAGCAACGG CGAATCTTAT TGTGCCGCAA
CTTTCGCTTG CCTTGCAAAA CTATCTTGCC TTTGAAGAAA TTAACGAGTT GCGCCATAAG
CTTGAGGGTG AACGAAGCTG CCTTATTGAA GAGATTACGG CAACGCATCA TGTTGGGGAA
ATTATAGGGA GCAGCTCTGC GTTAACTGCC GTGCTCTATC GCCTCCAGCA AGTTGCGCCC
ACCGATGCCA CCGTACTTAT TCAAGGCGAA ACGGGCACAG GTAAAGAGCT TTTTGCGCGT
GCCTTGCACA ACCTTTCAGC CCGAAGCAAA CGCGCTCTTA TTAAGGTGAA CTGTGCCACC
CTTCCCGCCT CGCTTATTGA GTCGGAACTT TTTGGACATG AAAAAGGGAG CTTTACGGGA
GCCACAGAAC GGCGTATTGG TAAATTTGAG TTAGCCGAAG GGGGCACTAT TTTTTTGGAT
GAAATTGGGG AGCTGCCACT TGAGTTGCAG GCAAAGCTGC TACGCGTGTT GCAAGAGCGT
GAATTTGAAC GGCTTGGTGG ACGCCACGTT ATTCGTGCAA ATGTGCGCGT TATTGCTGCT
ACCAATCGCA ATGTAGAGCA AGAGGTTGCT TCAGGGCGAT TCCGCGAAGA CCTCTATTTT
CGCCTTAACG TGGTGCCACT TCACGTTCCA CCGCTTCGTG AACGGCGCGA TGATATTGCT
CTGCTTGCCA ACCACTTTGC GAACCGCTAC GCCCGTGAGT TTAGCAAGCC AAACCGTCCC
ATCCGCCAAA ACGATATGCA ACGCTTGCTC GGGCGCGAAT GGAAAGGCAA CATTCGTGAA
CTTGCCCACA TGGTTGAACA AGCCGTTATT CTCTCGGAAG GCTCGACGCT TGATTTCGCA
ACCATTCTTG CTCCCCAACA AACACTCAAT GCTCCTACAA ACCGTGCTAT CTCCAGCCTC
CGCACTATGC GCGCCTTTGA GGAGGAAATG ATTGCGATGG AAAAGCAGCT TATTCTTGAT
ACTCTTGAAG CAACGGGTGG ACGGGTAAGT GGCTCTGGAG GTGCTGCCGA ACGGCTTGCT
ATGCACCCCA AAACGCTTTA TACCCGTATT GCAACGTTAG GATTAAGCAA GCGGTATGGA
TAG
 
Protein sequence
MPITHSLHEL HRLMRTISNA IGSLHDPQEL FHTIITQLRL FFAFDSAAII TLDNNQRHCS 
LFFEMLRFQL PEGFVRYQRP VAGSWIEPHV EAQKVAVIRL RDSLEHFPVD VELRQHLLEL
GLKQAVLSPL RAGGKVIGFL TFVFQEEQEW LDEECELLAT VSTPIAVAVS NALAYEELRK
REAQTAMQLA VNNALLTIRE RNSMMLAVCE QVQKLLPCTF LGIRVMDKEG NYRLYDNFRL
QPDGAFAPVT AALDMENYDS IARASFDLAI IGGYYGGNSF LELCARFPIL EHVRQKYGIC
SFLSLPLWEL PASRAGLIIS TTMRPFDGHD QATANLIVPQ LSLALQNYLA FEEINELRHK
LEGERSCLIE EITATHHVGE IIGSSSALTA VLYRLQQVAP TDATVLIQGE TGTGKELFAR
ALHNLSARSK RALIKVNCAT LPASLIESEL FGHEKGSFTG ATERRIGKFE LAEGGTIFLD
EIGELPLELQ AKLLRVLQER EFERLGGRHV IRANVRVIAA TNRNVEQEVA SGRFREDLYF
RLNVVPLHVP PLRERRDDIA LLANHFANRY AREFSKPNRP IRQNDMQRLL GREWKGNIRE
LAHMVEQAVI LSEGSTLDFA TILAPQQTLN APTNRAISSL RTMRAFEEEM IAMEKQLILD
TLEATGGRVS GSGGAAERLA MHPKTLYTRI ATLGLSKRYG