Gene Franean1_6880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6880 
Symbol 
ID5675193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8384567 
End bp8386027 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content66% 
IMG OID641245729 
Productnitrogenase molybdenum-iron protein alpha chain 
Protein accessionYP_001511120 
Protein GI158318612 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01282] nitrogenase molybdenum-iron protein alpha chain
[TIGR01862] nitrogenase component I, alpha chain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.782368 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CTCCGGCTCC GCAGCGGGCC GAGACCGAGG CGATGATCGC CGAGGTCCTG 
GAGCAGTACC CCCAGAAGGC GGCCAAGTTC CGCGCCAAGC ACCTCAAGGC CAACGACCCC
GAGGGGTCGA AGGAGTGCGA GGTCAAGTCC AACATCAAGT CCCGCCCCGG GGTCATGACG
ATCCGTGGCT GTGCCTACGC CGGTTCCAAG GGCGTGGTGT GGGGCCCGGT GAAGGACCTG
GTCACGATCA GCCACGGCCC GGTCGGCTGC GGCCAGTACT CCTGGGCCAC CCGGCGCAAC
TACGCCCGCG GTCCGTGGGG TGTCACCAAC TTCACCGCGA TGCAGATCAC CACGGACTTC
CAGGAGAAGG ACATCGTCTT CGGCGGTGAC CCCAAGCTGG AGCAGGTCTG CGACGAGATC
AACGAGCTCT TCCCGCTGGC CAAGGGCATC TCGGTCCAGT CCGAGTGCCC GATCGGTCTG
ATCGGCGACG ACATCGAGGC GGTTTCCCGC ACGGCGTCGA AGAAGCTTGG CAAGCCGGTC
ATCCCGGTCC GCTGTGAGGG CTTCCGCGGG GTGAGCCAGT CGCTCGGCCA CCACATCGCC
AACGACGCGG TCCGCGACCA CGTTCTCGGC ACCGGCGGCG AGTCCTTCAA GGAGACCCCG
TACGACGTCG CGCTCATCGG CGACTACAAC ATCGGCGGCG ACGCCTGGGC GTCCCGGCGG
ATCCTCGAGG ACATGGGCCT GCGCGTCATC GCCCAGTGGT CCGGCGACGG CACGCTGAAC
GAGATGGCCT CGACGCACCT GTCGAAGCTG AACCTGATCC ACTGCTACCG CTCCATGAAC
TACATCTGCA CCACCATGGA GGAGCGTTAC GGCACTCCGT GGATCGAGTT CAACTTCTTC
GGGCCGACCA AGATCGTCAA CTCGATGCGG GCCATCGCGG CCAAGTTCGA CGAGACGATC
CAGGCGAAGA CCGAGGCCGC GATCGCGCGT TACCAGAAGC GCTTCGACGA GATCACGGCG
GCGTTCAAGC CGCGCCTCGA CGGCAAGCGC GTCATGCTCG CTGTCGGCGG CCTGCGTCCC
CGTCACACCA TCGGCGCCTA CGAGGACCTC GGCATGGAGG TCGTCGGCAC CGGTTACGAG
TTCGCGCACA AGGACGACTA CACCCGCACG TACGCCGAGC TCAAGGAAGG CGTCGTGCTC
TACGACGACC CGACGGCGTT CGAGCTGGAG GAGTTCGCCA AGCGGCTCAA GCCGGACCTC
ATGGGTGCGG GCGTCAAGGA GAAGTACGTC TTCCACAAGA TGGGCATCCC CTTCCGCCAG
ATGCACTCCT GGGACTACTC CGGGCCGTAC CACGGCGTCG ACGGCTTCGC GGTCTTCGCC
CGCGACATGG ACATCGCGAT CAACAGCCCG ACCTGGGACC TCATGGAGAC CCCCTGGTCG
AAGGCCGGAG AGGTGTTCTG A
 
Protein sequence
MTTTPAPQRA ETEAMIAEVL EQYPQKAAKF RAKHLKANDP EGSKECEVKS NIKSRPGVMT 
IRGCAYAGSK GVVWGPVKDL VTISHGPVGC GQYSWATRRN YARGPWGVTN FTAMQITTDF
QEKDIVFGGD PKLEQVCDEI NELFPLAKGI SVQSECPIGL IGDDIEAVSR TASKKLGKPV
IPVRCEGFRG VSQSLGHHIA NDAVRDHVLG TGGESFKETP YDVALIGDYN IGGDAWASRR
ILEDMGLRVI AQWSGDGTLN EMASTHLSKL NLIHCYRSMN YICTTMEERY GTPWIEFNFF
GPTKIVNSMR AIAAKFDETI QAKTEAAIAR YQKRFDEITA AFKPRLDGKR VMLAVGGLRP
RHTIGAYEDL GMEVVGTGYE FAHKDDYTRT YAELKEGVVL YDDPTAFELE EFAKRLKPDL
MGAGVKEKYV FHKMGIPFRQ MHSWDYSGPY HGVDGFAVFA RDMDIAINSP TWDLMETPWS
KAGEVF