Gene Francci3_0711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0711 
Symbol 
ID3903501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp813609 
End bp815189 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content71% 
IMG OID637878044 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_479824 
Protein GI86739424 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCGC CGGCCCTGCC GTCCGCGTTG GACCCACGTG GGCCCCGTCG CCACCGCTCC 
CCGCTGCGAC GGCTGTCCGT TGGAGCCGTC GCTCTGCTGT CCGTGCTGGT TCTCGGTCTG
AGCACCATCG GGTGGGCCGC CTACCGACAG TTCGACAAGG CCGTCACGCG CGTCGACTGG
GACATCGAGG GTGCCCGCCC GGCGAGCGCG GACGGCGAGG AGAACGTCCT GCTGCTCGGC
GACGACAGCC GGGAGGGCAC CGGGGGTGAG TACGGCGTGG TGGACGGCGT CCGGTCCGAC
ACCACCATCA TCGCCCACTT CGGCAAGGAC GGCTCGGCCA CGTTACTGTC CTTCCCCCGG
GACATGCTCG TCCCGGTGGT GCCGCGGGAG AAGGCCACCG CCCACGACGG CCGGTCGAAG
CTCACCGAGG TGCTCGGGCT GGCGGACGTC CCCGGTCTCG TCACCACGCT GGAGTCCCTG
ACCGGCCTGA AGATCGACCA TACCATCTCG ATCAATCTCG CCGGTTTCAA GACGATGACC
GACGCGGTGG GAGGCGTGAG CGTGTGCGTG ACGCCCCTAC CGAACGGCAG CACCCGCAAC
CTGCACGACT CGATGTCGGG ATGGAGCGGA CGGCTCGGCG AGAACCGTCT CAACGGTGAT
CAGGCGCTGG CCTTCGTCCG GACCCGTTAC GCGCTCGGGG ACGAACGCCT TCGCATTCTG
CGCCAGCAGC AGTTCCTCTC CAAGCTGCTG GCGACAGCGA CCAGCAGCGG AGTGCTCACC
AACCCAGCCA AGATCACTTC ACTGATCGGT GCGGTCGGCA GCGCGCTGCG GATCGACCAG
GGTCTCGACC AGACCGCGAT GCTCAAGCTC GCGAAGCGGG TCAGCGAGCT GGGTCCGGGA
AGAATTCACT TCGTGACAGT CCCTACGCAC ATCGCGCTGC GCTCCGATGG CGCGGTTGAC
GACCTGGGTT CGATTCCCCC GCACGGGGCC GTACTCATCG TCGATCAGGC CGGCCTCGAC
CAGGTCCTCG CTCCGCTGCT GCCCGCGGGC ACCAAGCCTC CCGCACAGCG AACTCTCGAC
CCCGCCCAGG TCTCGATCGC CGCCGTCCGC AACGCCTCGG GACGTGCCGG ACTCGCCACC
GGCACGGTCG ACGGACTGCG GGCACGCGGC TTCACTGGGC CGATGACCGC CGCGACATCG
ACCCGCCAGA CTCTGACCGA GGTACGCCAT CCCCCGGGCC AGGAGGCTGC GGCACGCACC
CTGGCGGCGA CGATCCCCGG CAGCCGGATC GTCGCGGACG CCGGCCGCTC CGGGGCCGGC
CTCGTCCTCG TCCTCGGATC CACCTTCACC GGCCTGCCCA GCTCCGGCCT GCCCAGCGGT
GGGCTGCCGG GCGCGGCGGC AACCGTCGGC ACGAGGATCT CGACGACCTC GACCGCCGCC
GGTGGTGACA CCCGGGCGAC CACGATCGGC GGCACCGCGA CGCCGACCCC GGCGGCCGGC
GGGGCCGGCG TCGGCGGCAC CACAGGCACC GCGGCCGCTC CGGTCGGTCC CGTTCCCTCC
GACCCCTCGT GCACGCCATG A
 
Protein sequence
MRPPALPSAL DPRGPRRHRS PLRRLSVGAV ALLSVLVLGL STIGWAAYRQ FDKAVTRVDW 
DIEGARPASA DGEENVLLLG DDSREGTGGE YGVVDGVRSD TTIIAHFGKD GSATLLSFPR
DMLVPVVPRE KATAHDGRSK LTEVLGLADV PGLVTTLESL TGLKIDHTIS INLAGFKTMT
DAVGGVSVCV TPLPNGSTRN LHDSMSGWSG RLGENRLNGD QALAFVRTRY ALGDERLRIL
RQQQFLSKLL ATATSSGVLT NPAKITSLIG AVGSALRIDQ GLDQTAMLKL AKRVSELGPG
RIHFVTVPTH IALRSDGAVD DLGSIPPHGA VLIVDQAGLD QVLAPLLPAG TKPPAQRTLD
PAQVSIAAVR NASGRAGLAT GTVDGLRARG FTGPMTAATS TRQTLTEVRH PPGQEAAART
LAATIPGSRI VADAGRSGAG LVLVLGSTFT GLPSSGLPSG GLPGAAATVG TRISTTSTAA
GGDTRATTIG GTATPTPAAG GAGVGGTTGT AAAPVGPVPS DPSCTP