Gene Franean1_7187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7187 
Symbol 
ID5675488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8775000 
End bp8778596 
Gene Length3597 bp 
Protein Length1198 aa 
Translation table11 
GC content76% 
IMG OID641246024 
Producttranscriptional regulator 
Protein accessionYP_001511412 
Protein GI158318904 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGGCG TGCCGTACTA CCGCCTGTTC GGGTCGATCG AGGTCGTCCG GGACGGCCGG 
CCGGTCCAGC TCGGCGGCCC CAAGCAGCGG GCGGTGCTGG CCGCCCTGCT CCTCGATGCC
GGCCGGGTGG TCTCCGTCGA CCGCCTGGCC GGCGCCGTGT GGGGCGACGA GCATCCGCCG
AGCATGCTCT CCAGCCTGCA CGCGCACATC TCCAACCTGC GGCGGCTGCT GCGCGACGAC
GAGCGGGCCA CCTCGCCGAT CGTCCGCCGC ACCCCGGGCT ATCTTGCGGA CGTCCCCTCG
AACGACCTGG ATCTGCGGCT GTTCGAGCGG GAGTGCGACC GGGCCCAGGC CGCCGCCGAC
GCCGGGGACT GGCCGGACGC GGTGGCGGCC GCCGACCGGG CGGCGGCGCT GCGCCGGGGC
CCGCTGCTCG CCGAGTTCGG CGACGAGCCG TGGGTGCGCG GCGTGGCGAA CGCGGTCGAC
GAGCGGTGGG CGCAGTGCGA GCGGAGCGCG GTCGTCGGCC TGCTCGGCTC CGGGCGGATC
ACCGCCGCGG TGCTGCGCTC CCGCCAGCTG GTGCACGACG CCCCGCTAGC GGAACGCGCC
TGCCACCTGC ACATGATCGC GCTCTACCGG GCGGGGCGCG CGGCCGAGGC CCTCGACGCC
TTCCGCGACC ACGCCCGGCG GCTGGCCGAC GAGCTCGGTC TGGAGGCCAG CCCGGCGCTG
CGCGACCTGC AGGGCGCCAT CCTGCGCCAG GACCCGGCGC TGGATTCCTG GCCAGCCTCC
CCCCGCACCG CCGACCGAGC CACTCCCACC ACCCCGGCAG CTCCCACCGG CCCAGCCAAC
GCTTCCACCG GCCCAGCCAC CACCGCCGCC GCGGCCAACC CCGCCAACCC GATCGCCGCC
AACCCGATCG CCGTCGGCGC GGCCCGCCCC GCCGCCCCGA CCACCATGGC CGGCAGCGGG
GCGTCCGAAG GCGAGGGAAC GCCGGGAGCC CGGTACGGGG AGCTCGTCGG CCGGGTGCGC
GAGATCGCCG TGCTCGACTC GGTGCTGTCC GAGGCGATGA CCGGGCCCGT CCGCTGGGTG
GTTCTCACCG GCCCCGCGGG CATCGGCAAG AGCCGGCTCG CCGAGGAGGC CGCCGCCGGC
TGGCACCGGG CCGGCGGGGC GGTCTCGCGG ACTGGGTGCC CCGACGACGA CGCCGTCCCG
CCCTGGTGGC CGGTGCGCCA ACTGCTACGT GACCTCGGCG CCGACCCCGA CGATCTGCTG
ACCCCACCGA GCGGCGCCGA CGCTGACGCG GCCCGGTTCG TCGTCTACGG CCGGGTGCTC
GACGCGCTCT CCGAGGCCGC GCGGACCCGT CCGCTGCTGG TGGTGGCCGA GGACGTCCAC
TGGGCGGACA CCGCGAGCCT GCGGCTGCTC ACCCACCTCT CCGACGCCGG GGCGTGCCCC
GGGCTCGCCC TCGTCGTGAC CGCCAGGGAC GTCACCGGCC GCCCCGAGCT CGACCGGCTG
CTCGCCGCCG CGGCCCGCCG GCACGGATCA CGGCGGCTGG CCGTCCCACC GCTGACCGAG
GGCGAGGTGT CGGAGCTGGT CAACCGGATC AGCGGCCAGG CCATCGACGA CGCCGAAGCC
GCCGAGCTGG CCGACCGGAC CAGCGGGAAC CCGTTCTTCG TCTGCGAGTA CGCCCGCCTG
CCCGCCGAGG ACCGCGCCGG CGGAAAGGTA CCCGTCGCCG TGCGCTCGGT GCTCGGGCAG
CGGCTCGCCG TCCTCGATCC GGCGGTGCTC CAGGTGCTGC GCGCTGCCGC CGTCATCGGC
GACGTCCTGG ACATCGACCT GCTCGGCAAG GTGACCCGGC TCGATCGCGA CGAGCTCGCC
GACCTGCTGG ACGAGGCCTC GGACGAACAT GTGATCGTCC AGGCCGCCGG CACCGGGCGG
TACATGTTCG CGCACGCGCT GCTGCGCGAC GAGGTCGTCG CCGGGATCTC CAGCCTGCGC
CGCCAGCGGC TGCACCTGCG GGTGGCCGAG GCCCTCGGCC TGGTGGACGG TGGCCCAGTC
AACGGAGGCT CCGGCGGAGG CTCCGCCGGG GGCGAGGCGC TCTCCCGCCG GGCCGCGCAT
CTCGCCGCCG CCTGGCCGCT GGCCGAGTCC ACCGACGTGT TCGACGCCTG CCGCGCCGCC
GCCCTCGACG CGGAGCGCCG CTGGCAGTCG GAAGCCGCCG CGCACTGGTG GGGGCAGGCG
CTGGACGTCC TCGACCGGAG CGCCGGTGAT CTCGACATCG ACCGGGACGA GGTGCTCGTC
GCGCGGGTCA GCGCGCTCGC CCGCGCCGGG CGGGGCCAGA CGGTGCTCGA CGTCGTCGAC
GCCGGCCTGC TCGACGCGGT GCGCCGCGGG CGGCTCGACT CGGCCGGCCG GCTGGCCGCC
ACGCTGCTAC GGACCAGCGG ATCCTGGCCC TGGGCCGTCT ACGGCGACGA CCCGGCACCG
CTGCTGGCGC GCCTCGCCGG CCTGGAGACC CTTGTCGCCG CCGACCCGGC CGCGCATGTG
CGGGTGCTGG CCGCGCTCGC CGTCGGCAGC GCCTACGACC CGGACGGCTC CGTCCCCGAC
CTGCTCGGCC GCCGGGCGAT CGAGCTGGCC GAACGCATCG GTGACGACGA GGCCCTGGCC
GACGCGCTAC TCAGCCGGGC GCTGGCGTTC TCCGGCATCG CCGAACGGGC CGCGGAGTCC
GTCGAGCTAC TCAACCGGCT GGCCACCGTC CCGCACGCCA GCGCGCAGAT CGACGAGGTG
ATCGCGCACG GGCTGCTCTA CCTGGCGAAG ACGGCGCTGG GCGACCCGGG GTCCGCCGAG
CACGTCCGGC TCGGCGCGCT CGGCAGCGAC CTGCTGCGGC TGCCGGCCAG CCGGGTGCAG
TTCCGCTGGG CGCAGGGCTC ACTGGCGCTG TGGCGGGACG ACGACCTCTC CACGGCGGCG
GAGATCTACC ACCACGCCTT CGCCCTGCAC CGGGAGACCG AGCTCTACGA GAGCGGCGTG
TACCACCTCG CGCTGCTCGC TCTGTGCTGG GAGCAGGGCC GGCTGGACGA TCCGGACGAG
CCGGTGCCGA TCAGCCCGTT CGTCCCGTGG GCGCCGGCGC TGACCGCGGT CGCCCGCGGC
GACCCTGGGG CCGACAAGCT GCTCGCCGCG GAGATCGCGC AGGTCGAGCC GGTCACCTGG
ACCACCCACG CGCGGCTGAC GATGCTCGCC CACGCGGTCG CGGACCTCGG CCTGCGCTCA
CAGGTCAGCA CACTGACCGC GCGGTTGACA CCCGTCGCGC ACTGCGTCGC GAACATCGGC
CAGTGCGGCT TCGTCGGCAC GGTCGCGCTG GCCCTCGCCC GGCTGGCCGC GCTGGACGGC
GACCTTCCGG CCGCGCGGGG GCACCTGCGC ACCGCCGTGG AGGTCGCCAC CCGCGCGCAG
GGCGTCGGCG CGCTGCTGCG CTGCCGCCTG TTCGCCGCGG AGCTCGCCTC GCTCGCCGGC
GACCCCGTGG ACCTCGACGA CCTGCGCGAC GTCGCCGACC GCGCCGCACG CCGCGGCATG
ATCGGCGTAG CCCGCGACGC CCGCACCCTC CTCACCCGGC ACATCGACCC GACCTGA
 
Protein sequence
MRGVPYYRLF GSIEVVRDGR PVQLGGPKQR AVLAALLLDA GRVVSVDRLA GAVWGDEHPP 
SMLSSLHAHI SNLRRLLRDD ERATSPIVRR TPGYLADVPS NDLDLRLFER ECDRAQAAAD
AGDWPDAVAA ADRAAALRRG PLLAEFGDEP WVRGVANAVD ERWAQCERSA VVGLLGSGRI
TAAVLRSRQL VHDAPLAERA CHLHMIALYR AGRAAEALDA FRDHARRLAD ELGLEASPAL
RDLQGAILRQ DPALDSWPAS PRTADRATPT TPAAPTGPAN ASTGPATTAA AANPANPIAA
NPIAVGAARP AAPTTMAGSG ASEGEGTPGA RYGELVGRVR EIAVLDSVLS EAMTGPVRWV
VLTGPAGIGK SRLAEEAAAG WHRAGGAVSR TGCPDDDAVP PWWPVRQLLR DLGADPDDLL
TPPSGADADA ARFVVYGRVL DALSEAARTR PLLVVAEDVH WADTASLRLL THLSDAGACP
GLALVVTARD VTGRPELDRL LAAAARRHGS RRLAVPPLTE GEVSELVNRI SGQAIDDAEA
AELADRTSGN PFFVCEYARL PAEDRAGGKV PVAVRSVLGQ RLAVLDPAVL QVLRAAAVIG
DVLDIDLLGK VTRLDRDELA DLLDEASDEH VIVQAAGTGR YMFAHALLRD EVVAGISSLR
RQRLHLRVAE ALGLVDGGPV NGGSGGGSAG GEALSRRAAH LAAAWPLAES TDVFDACRAA
ALDAERRWQS EAAAHWWGQA LDVLDRSAGD LDIDRDEVLV ARVSALARAG RGQTVLDVVD
AGLLDAVRRG RLDSAGRLAA TLLRTSGSWP WAVYGDDPAP LLARLAGLET LVAADPAAHV
RVLAALAVGS AYDPDGSVPD LLGRRAIELA ERIGDDEALA DALLSRALAF SGIAERAAES
VELLNRLATV PHASAQIDEV IAHGLLYLAK TALGDPGSAE HVRLGALGSD LLRLPASRVQ
FRWAQGSLAL WRDDDLSTAA EIYHHAFALH RETELYESGV YHLALLALCW EQGRLDDPDE
PVPISPFVPW APALTAVARG DPGADKLLAA EIAQVEPVTW TTHARLTMLA HAVADLGLRS
QVSTLTARLT PVAHCVANIG QCGFVGTVAL ALARLAALDG DLPAARGHLR TAVEVATRAQ
GVGALLRCRL FAAELASLAG DPVDLDDLRD VADRAARRGM IGVARDARTL LTRHIDPT