Gene Franean1_4355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4355 
Symbol 
ID5672710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5199243 
End bp5200313 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content71% 
IMG OID641243228 
ProductAraC family transcriptional regulator 
Protein accessionYP_001508645 
Protein GI158316137 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGTCT TCCGCTCGGC GGGCCCGCGC GGTTCGCGGG TACCTGTCTC ACCCTGCCGC 
CCGCCGGAAA CATACGGTAG TGGCCTGAAT GCCACATATC CCAAGGATCA GGCCATGCAT
CGGATCGTCG TCGTCGCCGT CCCGCCGGTC ACCACCCTTG ATCTGTCCAT CCCGGCGGCA
GTGTTCCCGG CCGCGGTGGT CCACAGCCAG CCGGCCTACG AGGTCGTGAT CTGCACGGCC
GAGCCCGGGA TCGTACCTGG GTACACCGGG CCCAGCGTTG TGGTGGACCG GGGCCTCGAC
GTGATCGACA GCGCCGACAC CGTGATCGTC ACGGGAACCG GAGCTCGCGC CCACGCCGAC
CAACGGGTCC TGGACGCGCT ACAGCGGGCC GCTGACGACG GCCGGCGCAT CGCCTCGATC
TGCACGGGCG CCTTCGTGCT GGCCCAGGCC GGGCTGCTCA ACGGTCGCCC GGCCACCACG
TACTGGCAGT ACTCCCAGGA GATGCGCCGC CGCTTCCCAG CCGTCGACCT GCGGCCCGAC
GTCCTGTACG TCGACGACGG GACCGTGCTG ACCTCCGCCG GCCTGGCCGC CGGTCTCGAC
CTGTGCATCC ACATGATCCG GCGCGACCAC GGGGCGGTGG TCGCCAACGC CGTCGCCCGA
GCCGCGGTCA TCGCGCCGAT CCGTCCCGGC GGCCAGGCCC AGTTCATCGA GACACCGCTG
CCACCGGAGA ACGGGACCTC GCTGGCCCAG ACCCGCGCCT GGGCGGCGGA GCACCTCGCC
GAACCGCTGA CACTCGCCCG CCTCGCCGCC CACGCCCACA CCAGTACCCG CACGCTCACC
CGCCGCTTCC GGGAGGAGAC CGGTCTCAGC CCACTGCAAT GGTTGCTGCA CCAGCGAATC
GACCGGGCTC GGGAACTCCT CGAGGCGACG GATCTGCCGA TCACCGCCGT CGCCCGGCAA
AGCGGCCTGG GAACCCCCGA GTCGCTGCGC CTCCACCTCC TGCGCCGCAA CGGGCTCACC
CCCAGCGCCT ACCGCGACAC ATTCACCCGC GTCGGACCAA CCCCGACCTG A
 
Protein sequence
MFVFRSAGPR GSRVPVSPCR PPETYGSGLN ATYPKDQAMH RIVVVAVPPV TTLDLSIPAA 
VFPAAVVHSQ PAYEVVICTA EPGIVPGYTG PSVVVDRGLD VIDSADTVIV TGTGARAHAD
QRVLDALQRA ADDGRRIASI CTGAFVLAQA GLLNGRPATT YWQYSQEMRR RFPAVDLRPD
VLYVDDGTVL TSAGLAAGLD LCIHMIRRDH GAVVANAVAR AAVIAPIRPG GQAQFIETPL
PPENGTSLAQ TRAWAAEHLA EPLTLARLAA HAHTSTRTLT RRFREETGLS PLQWLLHQRI
DRARELLEAT DLPITAVARQ SGLGTPESLR LHLLRRNGLT PSAYRDTFTR VGPTPT