Gene Franean1_3976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3976 
Symbol 
ID5672337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4761380 
End bp4762663 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content74% 
IMG OID641242855 
Producthypothetical protein 
Protein accessionYP_001508272 
Protein GI158315764 
COG category 
COG ID 
TIGRFAM ID[TIGR02678] conserved hypothetical protein TIGR02678 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.667235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.133993 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC AGCGCCGCAA CATCCCGCGC GGGTCGCGTC CCGGGTCGTC GTCGACAGCG 
GTCATCGACG CACTTGACGC GCAGCGCGCC GCCGAGCGCC GCCGGGCCAT GCGCGCGATC
CTGCGCCGCC CGCTGCTCGT CGCGCACGGT CCCGACGCCG ACGCCTTCCG CCTCGTCCGC
CGGCACCAGA CGTGGCTGCG GGACTGGTTC ACCCGAGAGA CGGGCTGGTC GCTGCGGGTC
GACCCGGAGG TGGCCCGACT GGCCAAGATC CCGGCCGACC TGACCGACGG CACCCGCCCG
GCCACGGCAG GATCGGCCCA GCAGCCGTTC GGCCGCCGCC GCTACGTCCT GCTCTGCCTG
GCCCTGGCCG GCCTGGAACG GGCCGACAAC CAGATCACCC TGGGCAGCCT GGCCGACGAC
GTGATGATGG GCTGCGCCGC CCCTGAGCTC GCCGAGGCGG GCGTCAGCTT CAGCCTCGAC
AGCCGGGACG AGCGCGCCGA CCTGGTGGCG GCCGTCCGCG TTCTCCTTGA CCTCGGTGTG
CTGCGCCGGG TCGCCGGTGA CGAGACGACC TTCACCACCG GCACCGGCGA CGCCCTCTAC
GACCTCGACC GCCGGGCGCT GGCCGGCATG CTGGTGACCC GGCGCGGCCC CTCGACAGTC
CGCGACATCC CAGGCCCGGC CGATGTGGAA GGCCGGCTGG CCGCCGTCGT CGAGGAACTG
ACGGCCGACA CCGACGACGC GCGCAACCTG GCCCGTCGCC ACGCGCTGAC CCGGCGCCTG
CTGGACGACC CGGTCGTCTA CTACGTGGAT CTGGACGAGG GCGAGCGGGC CTACCTGACC
AGCCAGCGGG CCGTGCTGAC CCGGCGGATC ACCGAGGCGA CGGGCCTGGT CGCAGAGGTC
CGCGCGGAGG GGATCGCGAT GGTCGACCCC GACGGCGACC TCACCGACAC CCGAATGCCC
GAGGACGGCA CCGATGGCCA CGCCACCCTC CTGCTCGCCG AGCATCTCGC CCGCGAGGGC
ACCCGTCTCG GCCCAGGTGA GCCGATAGCT GTCGCCGACC TCGACGCCCA CATGCGCGAG
CTGATCGCCC AGCACCAGAA GCACTGGCGC AAGGGCGTCA CCGAGCCCGA CGCGGAGGCC
GAGCTGGTCG ACCGGGCGCT GTCGCGGATG CGAGCCCTCG GCCTGCTGCG CCGGCGCGGG
GACGACGTGT TCGCCCTGCC GGCGCTCGCC CGGTTCGCGC TCGGCGATCT GCGGGACGGC
GGCGGCCAGG AGTCACTGGC ATGA
 
Protein sequence
MTTQRRNIPR GSRPGSSSTA VIDALDAQRA AERRRAMRAI LRRPLLVAHG PDADAFRLVR 
RHQTWLRDWF TRETGWSLRV DPEVARLAKI PADLTDGTRP ATAGSAQQPF GRRRYVLLCL
ALAGLERADN QITLGSLADD VMMGCAAPEL AEAGVSFSLD SRDERADLVA AVRVLLDLGV
LRRVAGDETT FTTGTGDALY DLDRRALAGM LVTRRGPSTV RDIPGPADVE GRLAAVVEEL
TADTDDARNL ARRHALTRRL LDDPVVYYVD LDEGERAYLT SQRAVLTRRI TEATGLVAEV
RAEGIAMVDP DGDLTDTRMP EDGTDGHATL LLAEHLAREG TRLGPGEPIA VADLDAHMRE
LIAQHQKHWR KGVTEPDAEA ELVDRALSRM RALGLLRRRG DDVFALPALA RFALGDLRDG
GGQESLA