Gene Francci3_3496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3496 
Symbol 
ID3905230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4167916 
End bp4169376 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content72% 
IMG OID637880818 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_482578 
Protein GI86742178 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.625789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.409021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGC CTCCGGATCG TCCGGATCGT CCGGAACGGT ACGGGCCGAC GGCGGGACGC 
CGGGACGGCG AGCGGCCTGG CCGGTCCGTC CGTGACCCGC GGGACCCACG CCGGGCCCCT
GAGGAGGACT GGCCCGTCGG GCAGTGGCCG CGCCGGGAGC ACCGGCCCGC TCCGCGGGGT
CACGATGATG ACGAGCCGCG CGTACCCCCC GGCGCCTACC CCGATCCCTA CGGCCGGTAC
CCCATCGATC ACCCGGCGGC CGAGCCCCGG TATGACCGGC CGGCCCCGGA GGGTCCCGTG
GCACCCGGGC AGGGCGGCTT ACGGCGCGGT CTCACCGCCA TGGCCGCGGT GTTGGCGACC
CTCGTCCTGA TCCTGGCGAC CGGTGGCTGG GCCGTGCTGA AGCACTACGA CGGTAGGGTG
CATCACATCC CGCTGGCCTT CTCCGCCAGC GCGGACCGGC CGGCGTCCGC ATCGGGCGGA
ACGCAGAACA TCCTGCTCGT GGGGTCGGAC ACTCGGACGG GGACCAACGG CGAGTTCGGC
CAGGTCGAGG GGCAGCGGTC GGACACGACC ATCCTCGCCC ACCTCGACGG CGACGGTTCG
ACGACTCTCA TCTCCTTCCC CCGGGATCTG TGGGTGCGGA TTCCCGCGTA CACCGACGCG
GCGGGCACGC AGCACGCGGC GCAGCGGTCC AAGCTGAACG CCGCCTTCTC CTACGGCGGG
CCGTCCCTGC TCGTGGCCAC GATCGAGAAC CTCACCGGGA TCCGGGTCGA CCACTACGTT
CAGATCGACT TCATCGGCTT CCAGGGGATG ACGGACGCCC TCGGCGGGGT CACCGTCTGC
ATCAAGGAGC TTCCCCCCGA GCTGAAGGCA CGGGGTTTCG ACAACCTGCA CGACCATTAC
TCCGGGTTTT CCGGTCAGGT CGGCGAGAAC ACGCTGAACG GGGCGCAGGC CCTCTCCTTC
GTCCGGCAGC GGTATGGCCT ACCCGAGAGC GACATCGACC GCATCCGCCG CCAGCAGCAG
TTCCTCGGTG CCGTCTTCCA GCGGATCGCG TCGACGGACA CCCTGCTCAA CCCGGCGAAG
CTGCTCGGGG TGGTCGACTC CGCCACCTCG GCGCTGACGC TCGACGAGGC CACCTCCCTC
GCCGACCTCC GGTTTCTCGC GGTGCGGATG CAGTCGATCG GATCGGGCGG CGTCGCGTTC
ACGACGGTGC CGGCGGCAGC TGGCACCCGC GGGGGGCAGA GCGTCCTAGT TCCTGATCCA
GCCCAGCTGG GCACCTTCCT CAAGCCCTTC GGCGGTCGTG TCGCCGACGG GAGCTCCACC
GGCGCACTCC CGGCCGGCGC GGGGGGTGGC TCCTTCGCCG CGGTACCGGT GTCCGCCGCG
GTACCGGTGG TCTGGTCCCC GACCAGCGCG GCGGGAAGGT CCGTCCCGGG TGACGCGGGC
GGGGTGTCCT GCACCTATTG A
 
Protein sequence
MTWPPDRPDR PERYGPTAGR RDGERPGRSV RDPRDPRRAP EEDWPVGQWP RREHRPAPRG 
HDDDEPRVPP GAYPDPYGRY PIDHPAAEPR YDRPAPEGPV APGQGGLRRG LTAMAAVLAT
LVLILATGGW AVLKHYDGRV HHIPLAFSAS ADRPASASGG TQNILLVGSD TRTGTNGEFG
QVEGQRSDTT ILAHLDGDGS TTLISFPRDL WVRIPAYTDA AGTQHAAQRS KLNAAFSYGG
PSLLVATIEN LTGIRVDHYV QIDFIGFQGM TDALGGVTVC IKELPPELKA RGFDNLHDHY
SGFSGQVGEN TLNGAQALSF VRQRYGLPES DIDRIRRQQQ FLGAVFQRIA STDTLLNPAK
LLGVVDSATS ALTLDEATSL ADLRFLAVRM QSIGSGGVAF TTVPAAAGTR GGQSVLVPDP
AQLGTFLKPF GGRVADGSST GALPAGAGGG SFAAVPVSAA VPVVWSPTSA AGRSVPGDAG
GVSCTY