Gene Franean1_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0402 
Symbol 
ID5668826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp480893 
End bp481999 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID641239335 
Productintegrase catalytic region 
Protein accessionYP_001504774 
Protein GI158312266 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.591562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAGCGG GGCGGGACCC GTGTCGGGCA GGCTGGATGG TCATGGTGTG GTCGCTGCTC 
TACGCCCTGA CACGCAACAC TCTCGGGCTG ATGCTGCTCC GCGTGCGCGG GGATGCTGCG
AAGGACGTCG AACTCCTCGT CCTGCGGCAC CAGGTGGCGG TGTTGCGACG GCGGGTCCAT
CGTCCGGCAT TGGAACCGGC GGATCGGGTG ATTCTCGCAG CCCTGTCCCG GCTGCTACCC
CGGGCCAGTT GGGACATCTT CTTCGTCACC CCGGCCACCG TGCTGCGCTG GCACCGTGAG
CTCCTCGCAC GAAAATGGAC TTACCCGCGC AAGACGCACG GACGGCCGCC GATCCGCCGG
GAGATCCGTG AGCTGGTTCT GCGTCTCGCG CGGGAGAACC CGACCTGGGG CCACCGCAGG
ATCCAGGGCG AGCTCGTCGG GTTGGGTTAC TCGGTCGGGG TCGCCACCGT CTGGCGGATT
CTGCACCGCG CCGGCGTCGA CCCCGCACCT CGGCGGGCCG ACACCTCCTG GCGCACGTTC
CTACGCGCCC AGGCCTCCGG CATCCTGGCC TGCGACTTCT TCACCGTGGA CACCGTGTTC
CTCCAACGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CCCGCCGTGT TCATGTCCTC
GGGGTCACGA AGCACCCGAC CACGGCGTGG GTCACCCAGC AGGCACGGAA CCTGCTAATA
GATCTCGAGG AGCGCAGCCA CCGGTTCCGG TTCCTTCTCC GTGACCGTGA CACGAAGTTC
ACGTCCTCGT TCGACGCTGT CTTCACTGGG GCCGGTATCG ACGTGGTGCG CACACCACCG
CAAGCCCCGC AGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGT
ACCGACAGGC TGTTGATCGT CTCCGAACGG CACCTGACGT CAGTCCTCGA CAGCTACGCC
GAGCATTTCA ACACCCACCG GCCCCACCGC TCCCTCGGCC AGCACCCACC CGACTCGCCA
CCCGTGGTCG CCCCGACGTC GGAGTCCACC GTCCGTCGCA CACGCATCCT CGGCGGGCTG
ATCAACGAGT ACCGCAACGC CGCCTGA
 
Protein sequence
MSAGRDPCRA GWMVMVWSLL YALTRNTLGL MLLRVRGDAA KDVELLVLRH QVAVLRRRVH 
RPALEPADRV ILAALSRLLP RASWDIFFVT PATVLRWHRE LLARKWTYPR KTHGRPPIRR
EIRELVLRLA RENPTWGHRR IQGELVGLGY SVGVATVWRI LHRAGVDPAP RRADTSWRTF
LRAQASGILA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GVTKHPTTAW VTQQARNLLI
DLEERSHRFR FLLRDRDTKF TSSFDAVFTG AGIDVVRTPP QAPQANAIAE RWVGTARREC
TDRLLIVSER HLTSVLDSYA EHFNTHRPHR SLGQHPPDSP PVVAPTSEST VRRTRILGGL
INEYRNAA