Gene Franean1_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1404 
Symbol 
ID5669810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1700587 
End bp1701693 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID641240327 
Productintegrase catalytic region 
Protein accessionYP_001505754 
Protein GI158313246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.731191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCCG GACTGGGCCT GTGTCGGGCA GGCTGGATGG TCATGGTGTG GTCGCTGCTC 
TACGCCCTGA CACGCAACGC TCTCGGACTG ATGTTGCTCC ACGTGCGCGG CGACACCGCG
AAAGACGTAG AGCTCCTCGT CCTGCGACAT CAGGTGGCGG TGTTACGACG GCAGGTGAAC
CGTCCGACGC TGGAACCGGC GGATCGCGTC ATCCTCGCAG CCCTGTCCCG GCTGCTACCC
CGGGCCCGCT GGGGTTCGTT CGTCGTCACC CCGGCCACCG TGCTGCGCTG GCACCGTGAG
CTCCTCGCAC GCAAATGGAC CTACCCACGC AAGACCCCCG GACGGCCACC GGTCCGCCGG
GAGATCCGCG ATCTGGCCCT GCGCCTCGCG CAGGAAAATC CGACCTGGGG CCACCGCCGG
ATCCACGGCG AACTCGCCGG GCTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC
CTGCACCGCG CCGGCGTCGA CCCCGCACCC CGACAGGCCG ACACCTCCTG GCGCACGTTC
CTGCCCGCGC AGGCCTCCGG CCTGCTGGCC TGCGATTTCT TCACCGTGGA CACCGTCTTC
CTGCAACGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CCCGCCACGT TCATGTCCTC
GGGGTCACGA AGCATCCGAC CGCGGCGTGG GTCACTCAGC AGGCACGGAA CCTGCTGATG
GATCTCGACG AGCGTGGCCA CCGGTTCCGG TTCCTCATCC GTGACCGCGA CACGAAGTTC
ACGGCTTCCT TCGACGCTGT CTTCGCCGGG GCTGGTATCG ACGTGGTACG CACACCACCG
CAGTCGCCGC AGGCGAACGT GATCACGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGC
ACCGACAGGC TGCTGATCGT CTCCGAACGG CACCTGACAT CGACCCTCAC CAGCTACGCG
AAGCATTTCA ACACCCACCG GCCTCACCGC TCCCTCGGCC AGCACCCACC CGACCCGCCA
CCCGTGCTCG CCCCGACGCC GGAGTCCACC GTCCGTCGCA CCCGCATCCT CGGCGGGCTG
ATCAGCGAAT ATCGCAACGC CGCCTAA
 
Protein sequence
MPAGLGLCRA GWMVMVWSLL YALTRNALGL MLLHVRGDTA KDVELLVLRH QVAVLRRQVN 
RPTLEPADRV ILAALSRLLP RARWGSFVVT PATVLRWHRE LLARKWTYPR KTPGRPPVRR
EIRDLALRLA QENPTWGHRR IHGELAGLGY PVGVATVWRI LHRAGVDPAP RQADTSWRTF
LPAQASGLLA CDFFTVDTVF LQRIYVFFVV EHATRHVHVL GVTKHPTAAW VTQQARNLLM
DLDERGHRFR FLIRDRDTKF TASFDAVFAG AGIDVVRTPP QSPQANVITE RWVGTARREC
TDRLLIVSER HLTSTLTSYA KHFNTHRPHR SLGQHPPDPP PVLAPTPEST VRRTRILGGL
ISEYRNAA