Gene Franean1_6151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6151 
Symbol 
ID5674472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7483214 
End bp7484563 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content75% 
IMG OID641245003 
Productputative transcriptional regulator 
Protein accessionYP_001510401 
Protein GI158317893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0764047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.610732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACGG ACAATGCCCA CGCCTGCACC CATCCCCTCG CCTTCGTCCG CGCTCAGCGT 
GGGTGGTCCT ACCAACGGCT GGCACGCGTC GTCGCGCGTC GGGCCCGAGA TCTCGGGGTC
GCGAACATGG CCGCCGAGCG GCAGAAGGTC TGGCGCTGGG AGCACCGCGG TGTTGTGCCG
GACCGGGTCT CCCAGCTGGC GCTCGCCGCC GAGCTCGGGG TCCCGACCGA CCGGCTGGAG
TCCCACCCGT GGCCGTCCTG GCTGCCGACC GGTGACGCCG TGCGCACGGA GTACCCGTGG
ACGGCCTCGG GCAGCGTCAC CTCGCTCATG GACGTCGTCG AGGACGCGCT GACCGACCGC
CGCGGCTTCC TGACCATCAC CGGGCCCGGT GTCGCGTCGC TGTCGTCGGA GTGGCTCGGC
CTGGAGCCGG CCCGGCTGCA GGTCGCGCTC GCCGGCGGCC AGGTGGACGA GCAGATCGTC
AACCGGATCG AGCACAACAT CCCCGGCCTG CGGGTGATGG ACGAGCGTCT CGGCGGGGAG
AGCGTGCGGC GGCTGGTGGA CGCCGAGCTC GGCGTGGTCG CAGACCTGCT CGCCCGCGGC
TCCTACACCG AGGCGATCGG CCGGCACCTG CACCTGGTCG CGGCCGAGCT CGCCCGGTTC
GCCGGCTGGG TCTCCTTCGA CGCAGGCTTC CAGACCGCGG CGCAGCGGTA CTGGGTGACC
GCGCTGCACG CCGCGCACGC CGCGGGGGAC CGGATGCTCG GCGCGAACGT CCTGAAGAAC
ATGTCGCTGC AGTGCGTGGA CTTCGCCCGG CCGCGTGAGG CGGTCGACCT GGCGGAGGCC
GCGGTCGCCA GCGCGCGGCG GGCGACCGGC CGGGTCGCGG CGATGCTGCA GATGCGCCGG
GCCCGCGCGC ACGCCGCGCT GGGCGAGGCC AGCGCCTGCG CCCAGGCGCT GGCCTGCGCC
GAGGCGGCGT TCGTCGAGGC ACGCGCGGAG GACCCGGCCT GGTCGGCCTA CTTCGACGAC
GCCGAGTACC AGGCGCAGGT CGGCAGCTGC TACATCGACC TCGGCCACCT CGTGCACGCC
GATCGCTGGC TCGAGGGCTC GCTGGCCATC CACCCGCACG AGCGCACCCG GGACCGCGCG
ACCTACCTGT TGCGGCGGGC CGCCGTCCAG ATCGACCTGG GCAACCTCGA CGGCGGGTGC
TCGCTGGCGA AGGAGGCCCT GCCGATGCTG GAGGCGACCC GGTCGAAGCG GAACAGCCGG
CGTGCCGACG AGGTCCGGCG GCGGCTGCGC CGGCACTCGT CGGACCCGGC CGCGCGTGAG
CTCGACCAGG TACTGGCCCG CACGGCCTGA
 
Protein sequence
MLTDNAHACT HPLAFVRAQR GWSYQRLARV VARRARDLGV ANMAAERQKV WRWEHRGVVP 
DRVSQLALAA ELGVPTDRLE SHPWPSWLPT GDAVRTEYPW TASGSVTSLM DVVEDALTDR
RGFLTITGPG VASLSSEWLG LEPARLQVAL AGGQVDEQIV NRIEHNIPGL RVMDERLGGE
SVRRLVDAEL GVVADLLARG SYTEAIGRHL HLVAAELARF AGWVSFDAGF QTAAQRYWVT
ALHAAHAAGD RMLGANVLKN MSLQCVDFAR PREAVDLAEA AVASARRATG RVAAMLQMRR
ARAHAALGEA SACAQALACA EAAFVEARAE DPAWSAYFDD AEYQAQVGSC YIDLGHLVHA
DRWLEGSLAI HPHERTRDRA TYLLRRAAVQ IDLGNLDGGC SLAKEALPML EATRSKRNSR
RADEVRRRLR RHSSDPAARE LDQVLARTA