Gene Franean1_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1962 
Symbol 
ID5670363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2358374 
End bp2359366 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content76% 
IMG OID641240883 
Producthelix-turn-helix type 11 domain-containing protein 
Protein accessionYP_001506305 
Protein GI158313797 
COG category[K] Transcription 
COG ID[COG2378] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000800393 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTCCA GCCGGCTGAT GGCTCTGCTG CTCCATCTGC AGGCGCACGG CCGGGCCACG 
GCGGGGGAGC TCGCCGCCCA GTTCGAGGTG TCCGTCCGCA CCGTGCGCCG GGACGTGGCC
GCGCTCGCCG AAGCGGGTGT CCCACTGTGG TCTGAGCCCG GCCCGCACGG CGGGATCCGT
CTCGTCGAGG GCTGGCGGAC GAACCTGGAC GGGCTGACCG GCGACGAGGC GTCCGCGCTG
CTCATCGCCG GGGCGGGCGG GGACGTGCTC GGCGGCCTCG GCCTCGAGAC GGTCGCCGCG
GCCGCGCAGA CCAAGATCCT CGCGACGCTG CCGCCGGAGC TGCGGGCACG GGCGGGCCGG
GTCCGGGAAC GCTTCCACCT CGACGCGCCG GGCTGGTTCG GCTCCGAGGA GCCCGTGCCG
CACCTCGCGG TCGTCGCGGG CGCGGTCTGG TCCGGGCAGC GGATCACCGT CTGTTACGGG
CGGCCCGACC GGACGGTGGA GCGTTCTCTC GAGCCGCTGG GCCTCGTTCT CAAGGCCGGT
GTCTGGTATC TCGTGGCCCG CGGCGGGTCC GCCGTCCGCA GCTACCGGAT CGGCCGGATC
GTCGAGGCGG CGGTCCGGAG CGGGCCGGAG GGCCGCTTCA CCCGGCCCGC CGACTTCCAC
CTGGCGCGGT GGTGGGCGTC GTCGAACGAG GACTTCGCGC GCTCGCTGCT GCGCTGGCCG
GCGCGGCTGT GGCTGTCCCC GCGAGGCCTG CGGAGCCTGC CCGGAGTGCT CGGCCCGCTG
GCCGGCCAGC GGGCGCTGGC CACGGCCGGC GAGCCCGACG CGGATGGCTG GCGGGAGGTG
GAGGTCTGGT TCGAGGGCCC CGATGTCGCC GAGAGCCAGC TCTGGGCATT CGGCCCGCAC
GTGCGGGTGC TCGCCCCCGA CTCCCTGCGC GAGGCCCTCG CGCGGACGGC ACAGCAGGCG
GCGGCGAACA ACGGCGCTCC GAAGTCCGGC TGA
 
Protein sequence
MRSSRLMALL LHLQAHGRAT AGELAAQFEV SVRTVRRDVA ALAEAGVPLW SEPGPHGGIR 
LVEGWRTNLD GLTGDEASAL LIAGAGGDVL GGLGLETVAA AAQTKILATL PPELRARAGR
VRERFHLDAP GWFGSEEPVP HLAVVAGAVW SGQRITVCYG RPDRTVERSL EPLGLVLKAG
VWYLVARGGS AVRSYRIGRI VEAAVRSGPE GRFTRPADFH LARWWASSNE DFARSLLRWP
ARLWLSPRGL RSLPGVLGPL AGQRALATAG EPDADGWREV EVWFEGPDVA ESQLWAFGPH
VRVLAPDSLR EALARTAQQA AANNGAPKSG