Gene Franean1_1330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1330 
Symbol 
ID5669741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1598991 
End bp1600205 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content67% 
IMG OID641240261 
ProductXRE family transcriptional regulator 
Protein accessionYP_001505688 
Protein GI158313180 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGACG ACCAGATCCG CGGTGTGGGC GAGCGCATCG CCGATGCACG CAAGGCCCGC 
AGCTTCAGTC AACGCCAGCT GGCCGAACAC GCTCACGTGA GCCTCTCGCT GCTCCGCAAG
GTCGAGCAGG GCAGCCGAGA TGCCACGCCA GCACTCATCG CAGCGGTCGC ACGGGCACTG
ACCATCGACG TGACCGCCCT GACCGGCCAG CCCTACGATC TGGGCGGCCG GCAACTCGAT
CCGCTTCACC AGCACATCCC CAGGCTCCGA CGTGCGCTCA CATACTGGGA CCTGCCGCCC
GAGGGCGTCA CGCCACGTTC CCCGGCAGAG CTGGTCCGAG ACGCCGACCG CGCCGCGGAC
CTTCGCCGAT CCGGCAGCCA CGTCCAACTC GCTGCCGTAC TTCCGGCCCT GCTCACGGAG
ACCACCGCTG CTATCCACAG CGCTCCGGCC GGACCAGAGC GTGAGCGTGC CTACGCCACC
CTGACCGTGC TGCTCTTCGC CGCGCACTCC GTCACCTACA AGACGGGATA CATCGATCTT
TCCACCCTCA TCGAGGAACG CACCCATTGG GCGGCGCTCG CGTCCGCTGA CCCGGTACTC
GGGGCGCTCG CGGCGTGGAC ACGTACCACG TCCCTGCTCC AGGCTGGTTC TTACGACATC
GGCCTGCAGC TCCTGGACCG CGCCCAGGCG GAAATTCCCG CCGGCCCCGA ACCGGACGAC
AGCACCCTGC GGATGTCCGG AGCACTGCAT CTGCGTGCCG CGATGCTCGC GGCACGCAGT
GGAGACTCCG ATCTGACCAA CGATCACCTT GCCGCTGCGC GCCGGCTGTC TACGCGGCTC
GGCGATATCG ACCACGACGG CGGCCGTTAC CAGCTTGCCT TCGGCCCGGC CAATACCGGC
GTGCACGTCG TCGCCGCTGC CGTCGAGCTG GGCGACGGCG ACGAAGCGAT CAAACAAGCC
AGCCAGGTGC ACATCTCGAC CGGCCTGCCG AAGATCCGCG CCTGCCACCA CTACATTGAC
CTGGCCCGCG CCTACCTGTG GACCGGAAGA AAAGAGGATT CTCTACGCTG CCTGACGACC
GCGCGCGAGA TCGCTCCGCA GCAGACCCGT CATCACCCCA CGACCCGCGA GGTCGTGCGG
ATGCTCATAC GTCTACACCA CCGCAGCAAT ACGCAACTCA CGAAGATGGC AGGCTGGATC
GGACACGAAT CCTGA
 
Protein sequence
MDDDQIRGVG ERIADARKAR SFSQRQLAEH AHVSLSLLRK VEQGSRDATP ALIAAVARAL 
TIDVTALTGQ PYDLGGRQLD PLHQHIPRLR RALTYWDLPP EGVTPRSPAE LVRDADRAAD
LRRSGSHVQL AAVLPALLTE TTAAIHSAPA GPERERAYAT LTVLLFAAHS VTYKTGYIDL
STLIEERTHW AALASADPVL GALAAWTRTT SLLQAGSYDI GLQLLDRAQA EIPAGPEPDD
STLRMSGALH LRAAMLAARS GDSDLTNDHL AAARRLSTRL GDIDHDGGRY QLAFGPANTG
VHVVAAAVEL GDGDEAIKQA SQVHISTGLP KIRACHHYID LARAYLWTGR KEDSLRCLTT
AREIAPQQTR HHPTTREVVR MLIRLHHRSN TQLTKMAGWI GHES