Gene Cagg_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1871 
Symbol 
ID7266362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2291945 
End bp2293699 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content57% 
IMG OID643566708 
ProductFibronectin-binding A domain protein 
Protein accessionYP_002463202 
Protein GI219848769 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0246704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTTCG ATGCTCTTAC CCTTGCCGCC GTTGTTGATG AGTTACGGGC AACAATTTTG 
TTTGGACGCA TCCAGCGCGT TCTACTCCTT GGGCCGTTGA CGATCGGCTT GGAGGTCTAT
GCTCACGGCC AACGTCGGTA TGTGATCGCC TCCGCCGATG CGCAAACGGC GCGCATTCAT
TTGGTCAGTC AACGACTAAC CCGTGGCGTT GATGGTGAAA CGCCGTTTTT ACTCCTTATG
CGCAAATACA TACTCGGTGG GCGAATCGTG AATATTGAGC AACCGCCCTA CGAACGAGTC
GTTTTGTTCA GTATAACGAA ACCGGCGGAG TCGCGCAAAC GCTCACATTC ATCTGATCCT
GATGAAGTGA TCCTTGACGA ACTGGATGAT GATGAGACGG AGGAGACCGA CGAGTGGTTG
CACTGCGATC TCATTGTGGA GCCACAAGAT CGGCGGAGCA ACATCATACT GGTTGATGAT
AACAATCTGA TCCTCGATGC GATCAAGCGG GTGACGCCGC GGATGAGTAG TCGGGTGATT
ATGCCACGTC GGGTCTATAC GCTGCCGCCG GCGCCGGACA AGCGTGATCC GTTGCGGGCC
AATGCAGCCG AGATTGAAGC GATTATGGGT GTCGGTGATC CGGTAAAAGC GCTCGTCAAT
GCTTACCGTG GCATCTCGCC ACAGGTGGCG CGTGAGGTGT TAGTACGGGC CTGTGGCGTC
CTGCCGACGA GTGGAACGGG CTTGCCGACA TACACCATTG CGGCTCGGCT GCGTGAGATC
GTTGCCGTCC CACCCGAACC ACATCTGGTC AACGATGAGA CCGGGCCGAT TGCCTACGCA
CCCTACCGGC CACTTCACTT ACCCGGTGCG GTGCCGATGG CGAGTATGAG TGAGGCGCTG
GAAGTCTTCT ACACCGCGCG CCAACAACCA CTGGGGCGAG ATCGGCGACG GGCCGATCTC
GAGGCGCAGT TGCAGGCCAG TCGCGAGCAG CTTCTTCACC AACGGGAACA GATCGTCAAT
GAATTGGCGC GGGTCAACGA ACTCGACCGT CTGCGGTGGG AAGGTGAGAT GATCTTCGCT
TTTCTTCACC AATTACCACG CGGTGCAACA GAATTGGTGG TAGAGGGTGA ACGGATCGCC
CTCGATCCAA ACCGGTCGCC AGTTGAACAG GCACAAGAAC GGTTTAAGGC GTATGAAAAA
GCGAAGAGTG CTCGTGCCAT CTTGCCGGCA CGACTCGCCG AAACAGAGAA CCGGCTGGCC
GGTCTCGACC AATTGTTGGC GATGTTAGCC ATTGCCGACG ATGCTACCCA GATTGACCTG
ATCGCGGCTG AAGCGGAAGA AGCCGGGTAT CTCCGCACAT CGGCGGTGCG ACGACAACGA
CCACGTGGCC GACCACTACA GGTACGTTCA AGCGACGGCT TACCGATCTA CATTGGGCGA
ACAGCTCGCC AGAACGAAGA AGTCACCTTT CGTATCGCTC GTCCCACCGA TCTCTGGCTC
CACGCCCGCA ACATCCACGG CGCCCACGTC ATCATTCGTG CCGACCATCC GCCCGAACGA
ACGATTGCCG AGGCAGCAGC ACTCGCAGCG TATTACAGTC AGGCGCGAGA TGATACGGCG
GTAGAAGTTG ACATTTGCCA ACGACGAGCC GTGCGCAAGA TTCCCGGTGG ACCAACCGGA
TTGGTCAGCT ACTATCCCGA ACGCACAGTA CGGGTAAAGC CGGAGCGGTG TGGTGAAGTG
GTACAAAAGT CGTAG
 
Protein sequence
MYFDALTLAA VVDELRATIL FGRIQRVLLL GPLTIGLEVY AHGQRRYVIA SADAQTARIH 
LVSQRLTRGV DGETPFLLLM RKYILGGRIV NIEQPPYERV VLFSITKPAE SRKRSHSSDP
DEVILDELDD DETEETDEWL HCDLIVEPQD RRSNIILVDD NNLILDAIKR VTPRMSSRVI
MPRRVYTLPP APDKRDPLRA NAAEIEAIMG VGDPVKALVN AYRGISPQVA REVLVRACGV
LPTSGTGLPT YTIAARLREI VAVPPEPHLV NDETGPIAYA PYRPLHLPGA VPMASMSEAL
EVFYTARQQP LGRDRRRADL EAQLQASREQ LLHQREQIVN ELARVNELDR LRWEGEMIFA
FLHQLPRGAT ELVVEGERIA LDPNRSPVEQ AQERFKAYEK AKSARAILPA RLAETENRLA
GLDQLLAMLA IADDATQIDL IAAEAEEAGY LRTSAVRRQR PRGRPLQVRS SDGLPIYIGR
TARQNEEVTF RIARPTDLWL HARNIHGAHV IIRADHPPER TIAEAAALAA YYSQARDDTA
VEVDICQRRA VRKIPGGPTG LVSYYPERTV RVKPERCGEV VQKS