Gene Francci3_3296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3296 
Symbol 
ID3904082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3902288 
End bp3903724 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content69% 
IMG OID637880621 
Productputative transcriptional regulator 
Protein accessionYP_482382 
Protein GI86741982 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0979413 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGA CGATCGCCTT CCTGCGCGCG GCGGGAGGCG ACACCACCGA GGTGGAGGTC 
AAGTCTGCCG CCGGCGGGTT ACCCGCCTCG TTGACCTCTA CGCTGAGCGC GCTGGCGAAT
CAGCCCGGGG GCGGGACCAT CATCCTGGGA CTCGACGAGC GGGCCGGGTT CCGCCCCGTC
GAGCTCAGCG ACCCGCAGGT ACTCAAGCAG GGCCTGGCTG CCAGGGCCCG GGCATTCACA
CCGCCGGTCC GTCTCACGAT CGAAGACGGG GAGGTTGACG GGGCCGCGGT CGTGGTTGCG
CAGGTCCAGG AGTGCGACCG GTCGACCAAG CCCTGTCGCG TCACCGCGAC CGGCAGGGCG
TACCTACGTG GCTACGACGG CGACTACGCC CTGTCTGACG TGGAGGAACA GGGGTTCCTG
GCCGCTCGCC AGCCACCGCT GTTCGACCGT TCACCTGTCG AGGACGCCAC CATCGACGAG
CTGGACACCG AACTCGTCGA TGCTTTTCTA CTCGCTGTCC GCGAACGCGA CCCGGCCGGG
CTCGGCCGTT TTCCCGACGA CACCGAGCTC CTACGCCGGG CTGGGGTCAC GATGGATGGC
GGGCAGCCAA CTGTCGCGGG ACTGCTCGCT CTCGGGGTCC ATCCCCAACA GTGGTTCCCT
CGCTACGTCA TCCAAGCCGC CGCGCAGCCC TTGCCCACCG GCTCCGCCGC AACGCGGGCC
CGCAACCAGG TCACCATCAG CGGACCGGTC CCGCGGATGC TCGACGCGGC GCTGCTCTGG
GCCCGACATA CCTTCGACAC CGCCATCGTC GCCGAGATGG ACGGCAGCGT TCGTGACCGT
CCGATCTACC CACTCGTCGC CTTCCGTGAG CTGGTCGCCA ACGCGCTGAT CCACCGCGAC
CTCGATCACT GGTCCGCCGG GCTGGCCGTC GAAGTGCGGC TTCTGCGGGA CCGCCTGGTA
GTGACCAATC CCGGCGGCCT GTACGGCATC ACCGTCGACC GGCTCGGACG CGACGCGGTG
ACCTCCGCCC GCAACGCCAG CCTGGTCGCG ATCTGCCAGC ACGTCCGCTC TGCGCAGACC
GGAGCTCGGG TTATCGAAGC CCTCGCCAGC GGGATTCCCA CCGTCACCGA GGCTCTCGCC
GACTGTGGCC TGCCGTCAGC CCACTACGTG GACAGCGGCA TCCGGTTCAC CGTCATCCTC
CACCAGTTCG CGACCGCCAC GCCCGTGGCA ACCGCCGAGC CCCCGCTGGG CGCCACGGAG
CGTCGCGTCT ACCAGGCCCT GACCCGTCAG GGACGAACAG TCAGCGACCT CGCCGAAGAG
CTCGGGCTGT CCGCTCCGAA CATCCGCAAG GCCCTGCGAA ACCTGCGCGG CCGCGGGCTG
ATCCTTCAAC TCGGCGGCAG AGGCAGGGCC ACCACCTACC AGCGGACGGA CTCATAG
 
Protein sequence
MSQTIAFLRA AGGDTTEVEV KSAAGGLPAS LTSTLSALAN QPGGGTIILG LDERAGFRPV 
ELSDPQVLKQ GLAARARAFT PPVRLTIEDG EVDGAAVVVA QVQECDRSTK PCRVTATGRA
YLRGYDGDYA LSDVEEQGFL AARQPPLFDR SPVEDATIDE LDTELVDAFL LAVRERDPAG
LGRFPDDTEL LRRAGVTMDG GQPTVAGLLA LGVHPQQWFP RYVIQAAAQP LPTGSAATRA
RNQVTISGPV PRMLDAALLW ARHTFDTAIV AEMDGSVRDR PIYPLVAFRE LVANALIHRD
LDHWSAGLAV EVRLLRDRLV VTNPGGLYGI TVDRLGRDAV TSARNASLVA ICQHVRSAQT
GARVIEALAS GIPTVTEALA DCGLPSAHYV DSGIRFTVIL HQFATATPVA TAEPPLGATE
RRVYQALTRQ GRTVSDLAEE LGLSAPNIRK ALRNLRGRGL ILQLGGRGRA TTYQRTDS