Gene Francci3_1558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1558 
Symbol 
ID3904790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1867985 
End bp1869622 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content72% 
IMG OID637878895 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_480663 
Protein GI86740263 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.931439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCAC CGCACCCGCA GAACGACGCC GCCGACCACG CGCGGGTGCT TCCACCCCGG 
CTCGACCCGC GGGGATCGGG GTCCGGCCGG TCGGGATCCG GGTCGGCACG CGGCGGACGG
TCGGCTAGGC GTCGCGGGTT GCGGGTCTTC CTGGCCTGCG TGTCCTGCCT CGTGTTCGTG
GCCACGGTGG GGGGCTGGGC CGCGTACTTC TATGTCGACG GCAAGGTCAA CCGGGTCGAC
CTCGGCCTCG GTTCGGGCGG CGACTCCGGC GGGGTGGGGA CCGCGAACTA CCTGCTGGTC
GGGACCGACA GCCGCGCCGG TACCAACGGT AAGTACGGAG ACGCGCCGGG CCAGCGCTCC
GACACGACGA TCCTCGCGCA CCTGACGAAG GATGGGACCA CCACCATGGT CTCCTTCCCC
CGCGACATGT ATGTGCTGAT CCCCGAGTAC ACCGACGGGG CCGGCAAACG TCACAAGGCC
TGGCACGACA AGTTCAACGC CGCGATCTCC GTAGGCGGCC CGTCGCTGTT GGTGCAGACG
GTGCAGAGCC TGACCGGCCT GCAGGTCGAC CACTACATCT CGATGGACCT GGAGGGTTTC
AAGGAGATCA CCGATGCCAT CGGCGGTGTC CAGGTCTGCA TCAAGCCGTC GGACGCGAAA
CCGAGCCGCT ACCAGGACGA CCGCGGGCGG TGGCGGATGA GCACGAACAC CAACGACCCG
ATGAGCGGAT TCGTCGGTGG GCCCGGGACG ATCCAGCTCG GCGGCGACCA GGCCCTGGCC
TTCGTTCGCC AACGCCACGG CCTGCCGGAC GGCGACTTCG ACCGCATCCG CCGGCAGCAG
CAGTTCATCG GCTCGATGTT CCACAAGATC ATGGCAGACG ACACCCTGAC GAACCCGATC
AAGGCCCAGA AGCTGGTATC GGCGGCAGCG AGCGCGCTCA CCCTCGACAA CCACACCAGC
ATCGCCGACC TGCGCAGCCT CGGCACCCGA GTCCGCGGGC TGGCCACCGG GGGCCTCCAG
ATGCAGACCG TGCCCGTCCA CGCGCCGACC CGTGCCGAGG GGGCGATCGA CGACAACGGC
AACATCCGCC TGCACGGAGT CCCGGCCTCC GTGCAGCTCT ACCGGCCGGA CGACCTGCAG
CGCATCGTCG CGCCGATGGG TGGCAAGGCC GAGGGCGCGA GTTCTACCAC CGACGCCGGG
AGCGGGGGGA CCGGTCCGGC GCTCGCTCCC GGCGCCGCCG CCGCGCCCTC CCAGGTCCGG
GTCGCCGTCT ACAACGGCTC GTCGCGGGCC GGACTCGCCT CCAAGGTCAC CGAGCAGCTG
ACTGCCAAGG GCTTCCACGC CCGCAACGCC GGCAACGCCT CGGTCCTCAC CCACGAGACC
TCCCGGGTGC TCTACGCCCC CGGGCAGGAG GCCGAGGCGA ACACCGTCGC CGCGGCGGTG
CCCGGCTCGG TCCTGCTCGC CGATGCGAGC ATCACCGGTG TCCAGCTCGT CCTGGGCTCC
GGCTTCACCG CGGTGGTGAC GCCGAGTGTG ACGGCCGGTG GGACCGCGCC TGCGCCCGCA
CCGGCGGCGG CCGGACCCGC CGCCGGACCG GCGCCGCCGG GAGCGCCCTC GCCGGGCCCG
CCCACCTGCA CCTACTGA
 
Protein sequence
MTAPHPQNDA ADHARVLPPR LDPRGSGSGR SGSGSARGGR SARRRGLRVF LACVSCLVFV 
ATVGGWAAYF YVDGKVNRVD LGLGSGGDSG GVGTANYLLV GTDSRAGTNG KYGDAPGQRS
DTTILAHLTK DGTTTMVSFP RDMYVLIPEY TDGAGKRHKA WHDKFNAAIS VGGPSLLVQT
VQSLTGLQVD HYISMDLEGF KEITDAIGGV QVCIKPSDAK PSRYQDDRGR WRMSTNTNDP
MSGFVGGPGT IQLGGDQALA FVRQRHGLPD GDFDRIRRQQ QFIGSMFHKI MADDTLTNPI
KAQKLVSAAA SALTLDNHTS IADLRSLGTR VRGLATGGLQ MQTVPVHAPT RAEGAIDDNG
NIRLHGVPAS VQLYRPDDLQ RIVAPMGGKA EGASSTTDAG SGGTGPALAP GAAAAPSQVR
VAVYNGSSRA GLASKVTEQL TAKGFHARNA GNASVLTHET SRVLYAPGQE AEANTVAAAV
PGSVLLADAS ITGVQLVLGS GFTAVVTPSV TAGGTAPAPA PAAAGPAAGP APPGAPSPGP
PTCTY