Gene Cag_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1228 
Symbol 
ID3748261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1629314 
End bp1630963 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content47% 
IMG OID637773761 
ProductFis family transcriptional regulator 
Protein accessionYP_379532 
Protein GI78189194 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID[TIGR01817] Nif-specific regulatory protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTATTA CACAACAAAA CGACTACGGC AGCATCAGTC TTTTAGCTGA AGTAAGCCGC 
ACCATTACGC ATGAAGAGGA CATTAACAAA GTGTTACGCT TAGTGCTCTT TATTATGTCG
GAAAACATGC ACATGCAGCG AGGTATGATT ACCATTCTTA ACCGCAGCAC GGGCGAAATT
GTTATTAACG AATCGTTTGG GCTAACCGAA GAGGAGCGTG AGCGTGGACG CTATCACATT
GGCGAGGGTG TTATTGGGCA TGTTGTAAAA ACCGGCAAAG CCGTTATTGT GCCCTCCATT
CAAGATGAAC CGCTTTTTCT TGATAGAACA GGATCGCGTG CTCAAGCAAA AAAAGAGGAA
CTCTGCTTTA TTTGCATTCC GATTAAAGCA GGCTCCGAAA TTATTGGTAC CCTCAGTGCC
GACCGCCACG TAGAACCCGT TACTGCCGAT GAGCATCGCC GCAAAACACG CCAAACCGAC
GACCGTGATG AGCGGATTGA TAAACTCCAA TTTTATGTGG ATCAGCTCTC CATTATTGCC
GCCATGATTT CGCAAGCCGT GCGCTTAAAG CAGCTTGCCT ACGAAGCAGG CAGCAAAAAC
GTGCAAGATT TGAGCAACTT GCCACCTTTT GCAGCCTCCA TTCCACCCCG CAGCGAAGAT
CGCCAAGTTA ACGCCATTCC GCCGCCATCG CCCGATCGCC CAGCAAACAT TATTGGCAAC
ACCAAGCCAA TGGTTTCGCT TTATTCCATG ATTGATAAAA TTGCCAAAAC CAGTGCCACC
ACGCTTGTGC TTGGCGAAAG CGGCGTTGGT AAAGAGTTAG TAGCAAGCGC CATCCACTTT
AAAAGCCGCC GCGCTGAAAA GCCCTTTATT AAATTTAACT GTGCAGCCCT TCCCGAAAAC
ATTGTAGAGA GCGAACTTTT TGGGCACGAA AAAGGAGCTT TTACAGGTGC CCTTGCTACA
CGGCATGGAC GCTTTGAAAT GGCAAACAAC GGCACTATTT TTCTTGATGA AGTGGGCGAA
CTCAGCTTAT CAGTTCAAGC GAAATTGTTG AGAATTTTGC AAGAAAAGGA GTTTGAGCGC
GTTGGCGGCT CAAAAACCAT TAAGGTAGAT GTTCGTGTTA TTGCTGCCAC CAACCGTAAC
CTCGAAGAGC TTATTCGGCA GGGGCAGTTT CGCGAAGATT TATTCTATCG CCTCAACATT
TTCCCCATTA CCGTACCACC CTTGCGCGAG CGTAAAACCG ATATTTTGCT GCTTGCTGAC
TATTTTGTTG AGAAGTATAA CAAAGCCAAT CAGAAAGGAG TACGCCGAAT TTCCACCACA
GCCATTGATA TGCTTATGCG CTACCATTGG CCTGGTAACG TGCGCGAATT GCAAAACTGT
ATTGAGCGTG CGGTTATTTT AAGTGAGGAT AATGTGATTC ACGGCTACCA CTTGCCGCCA
ACCTTGCAAA CGGCGGAATC AAGCGGTACG CCTTACACGG GCTCATTGCA ACAAAAGCTT
GATGCTATTG AAAAGGAAAT GATTATTGAA GCGCTTAAAC GCACGCAAGG CAATATGTCG
CGTGCTGCCA TGCAGCTTGG ATTATCGGAG CGTATTATGG GATTGCGCAT TAAAAAGTTT
AATATTGATT ATAGGAAGTT TAGGGTGTGA
 
Protein sequence
MLITQQNDYG SISLLAEVSR TITHEEDINK VLRLVLFIMS ENMHMQRGMI TILNRSTGEI 
VINESFGLTE EERERGRYHI GEGVIGHVVK TGKAVIVPSI QDEPLFLDRT GSRAQAKKEE
LCFICIPIKA GSEIIGTLSA DRHVEPVTAD EHRRKTRQTD DRDERIDKLQ FYVDQLSIIA
AMISQAVRLK QLAYEAGSKN VQDLSNLPPF AASIPPRSED RQVNAIPPPS PDRPANIIGN
TKPMVSLYSM IDKIAKTSAT TLVLGESGVG KELVASAIHF KSRRAEKPFI KFNCAALPEN
IVESELFGHE KGAFTGALAT RHGRFEMANN GTIFLDEVGE LSLSVQAKLL RILQEKEFER
VGGSKTIKVD VRVIAATNRN LEELIRQGQF REDLFYRLNI FPITVPPLRE RKTDILLLAD
YFVEKYNKAN QKGVRRISTT AIDMLMRYHW PGNVRELQNC IERAVILSED NVIHGYHLPP
TLQTAESSGT PYTGSLQQKL DAIEKEMIIE ALKRTQGNMS RAAMQLGLSE RIMGLRIKKF
NIDYRKFRV