Gene Franean1_3414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3414 
Symbol 
ID5671785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4044349 
End bp4045455 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID641242302 
Productintegrase catalytic region 
Protein accessionYP_001507722 
Protein GI158315214 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTGCCG GACTGGGCCT GTGTCGGGCA GGCTGGATGG TCATGGTGTG GTCGCTGCTC 
TACGCCCTGA CACGCAACGC TCTCGGACTG ATGTTGCTCC ACGTGCGCGG CGACACCGCG
AAAGACGTCG AGCTCCTCGT CCTACGACAT CAGGTGGCGG TGTTACGACG GCAGGTGAAC
CGTCCGACGC TGGAACCGGC GGATCGGGTG ATCCTCGCGG CGCTGTCCCG GCTGCTGCCC
CGGGCTCGCT GGGGTTCGTT CTTCGTCACC CCGGCCACCG TGTTGCGCTG GCACCGGGAA
TTCCTCGCAC GAAAATGGAC CTATCCCCGC AAGACACCCG GGCGGCCGCC GGTCCGCAGG
GAGATCCGCG AGCTGGTCCT GCGCCTCGCG CAGGAAAATC CGACCTGGGG CCACCGCCGG
ATCCAAGGCG AACTCGTCGG GCTGGGCTAC CCGGTCGGGG TCGCCACCGT CTGGCGGATC
CTGCACCGCG CCGGCATCGA CCCCGCGCCC CGGCGGGCCG ACACCTCTTG GCGTACGTTC
CTGCGCGCCC AGGCCTCTGG CCTGCTGGCC TGCGACTTCT TCACGGTGGA CACCGTGTTC
CTCCAGCGGA TCTACGTGTT CTTCGTCGTC GAACACGCCA CCCGCCGTGT TCACGTCCTC
GGGGCCACGA AGCACCCGAC CTCGGCGTGG GTCACCCAGC GGGCACGGAA CCTGCTGATG
GATCTCGACG AGCGCAGCCA CCGCTTCCGA TTCCTGATCC GTGACCGCGA CACGAAGTTC
ACGGCTTCCT TCGACGCTGT CTTCGCTGGT GCCGGCATCG ACGTGGTACG CACACCACCG
CAAGCCCCGA CGGCGAACGC GATCGCGGAA CGCTGGGTCG GCACCGCCCG CCGGGAATGC
ACCGACAGAT TATTGATCGT CTCCGAACGG CACCTGACGT CAGTCCTCGG CAGCTACGCC
GAGCATTTCA ACACCCACCG ACCCCACCGC TCCCTCGGCC AGCACCCACC CGACCCGCCG
CCCATGGTCA CCCCGACCTC GGATTCCACC GTCCGTCGCA CCCGCATCCT CGGCGGGCTG
ATCAACGAGT ACCGCAACGC CGCCTGA
 
Protein sequence
MPAGLGLCRA GWMVMVWSLL YALTRNALGL MLLHVRGDTA KDVELLVLRH QVAVLRRQVN 
RPTLEPADRV ILAALSRLLP RARWGSFFVT PATVLRWHRE FLARKWTYPR KTPGRPPVRR
EIRELVLRLA QENPTWGHRR IQGELVGLGY PVGVATVWRI LHRAGIDPAP RRADTSWRTF
LRAQASGLLA CDFFTVDTVF LQRIYVFFVV EHATRRVHVL GATKHPTSAW VTQRARNLLM
DLDERSHRFR FLIRDRDTKF TASFDAVFAG AGIDVVRTPP QAPTANAIAE RWVGTARREC
TDRLLIVSER HLTSVLGSYA EHFNTHRPHR SLGQHPPDPP PMVTPTSDST VRRTRILGGL
INEYRNAA